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COMPOSITIONS AND METHODS FOR HYDROXYLATTNG 

EPOTHELONES 

5 

Field of the Invention 

The present invention relates to isolated nucleic acids sequences and 
polypeptides encoded thereby for epothilone B hydroxylase and mutants and variants 
thereof, and a ferredoxin located downstream from the epothilone B hydroxylase 

10 gene. The present invention also relates to recombinant microorganisms expressing 
epothilone B hydroxylase or a mutant or variant thereof and/or ferredoxin which are 
capable of hydroxylating small organic molecule compounds, such as epothilones, 
having a terminal alkyl group to produce compounds having a terminal hydroxyalkyl 
group. Also provided are methods for recombinantly producing such microorganisms 

15 as well as methods for using these recombinant microorganisms in the synthesis of 
compounds having a terminal hydroxylalkyl group. The compositions and methods 
of the present invention are useful in preparation of epothilones having a variety of 
utilities in the pharmaceutical field. A novel epothilone analog produced using a 
mutant of epothilone B hydroxylase of the present invention is also described. 

20 

Background of the Invention 

Epothilones are macrolide compounds that find utility in the pharmaceutical 
field. For example, epothilones A and B having the structures: 

PH 

Me 

25 Epothilone A R=H 

Epothilone B R=Me 

have been found to exert microtubule-stabilizing effects similar to paclitaxel 
(TAXOL®) and hence cytotoxic activity against rapidly proliferating cells, such as, 
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tumor cells or cells associated with other hyperproliferative cellular diseases, see 
Bollag et al., Cancer Res .. VoL 55, No. 11, 2325-2333 (1995). 

Epothilones A and B are natural anticancer agents produced by Sorangium 
cellulosum that were first isolated and characterized by Hofle et <zl., DE 4138042; WO 
5 93/10121; Angew. Chem. Int. Ed. Engl. Vol. 35, Nol3/14, 1567-1569 (1996); and L 
Antibiot, VoL 49, No. 6, 560-563 (1996). Subsequently, the total syntheses of 
epothilones A and B have been published by Balog et al 9 Angew. Chem. Int Ed. 
Engl., Vol. 35, No. 23/24, 2801-2803, 1996; Meng et a/., J. Am. Chem. Soc . Vol. 
119, No. 42, 10073-10092 (1997); Nicolaou etal. 9 J. Am. Chem. Soc .. Vol. 119, No. 

10 34, 7974-7991 (1997); Schinzer et cL % Angew. Chem. Int. Ed. Eng .. Vol. 36, No. 5, 
523-524 (1997); and Yang et aJ. 9 Angew. Chem. Int Ed. Engl.. Vol. 36, No. 1 / 2, 
166-168, 1997. WO 98/25929 disclosed the methods for chemical synthesis of 
epothilone A, epothilone B, analogs of epothilone and libraries of epothilone analogs. 
The structure and production from Sorangium cellulosum DSM 6773 of epothilones 

15 C, D, E, and F was disclosed in WO 98/22461. Figure 1 provides a diagram of the 
biotransformation as described in WO 00/39276 of epothilone B to epothilone F in 
Actinomycetes species strain SC15847 (ATCC PT-1043), subsequently identified as 
Artiycolatopsis orientcUis. 

Cytochrome P450 enzymes are found in prokaryotes and eukaryotic cells and 

20 have in common a heme binding domain which can be distinguished by an 

absorbance peak at 450 nm when complexed with carbon monoxide. Cytochrome 
P450 enzymes perform a broad spectrum of oxidative reactions on primarily 
hydrophobic substrates including aromatic and benzylic rings, and alkanes. In 
prokaryotes they are found as detoxifying systems and as a first enzymatic step in 

25 metabolizing substrates such as toluene, benzene and camphor. Cytochrome P450 

genes have also been found in biosynthetic pathways of secondary metabolites such as 
nikkomycin in Streptomyces tendae (Bruntner, C. et al, 1999, Mol. Gen. Genet. 262: 
102-114), doxorubicin (Dickens, M.L, Strohl, WJL, 1996, J. Bacteriol, 178: 3389- 
3395) and in the epothilone biosynthetic cluster of Sorangium cellulosum (Julien, B. 

30 et al., 2000, Gene, 249: 153-160). With a few exceptions, the cytochrome P450 
systems in prokaryotes are composed of three proteins; a ferredoxin NADH or 
NADPH dependent reductase, an iron-sulfur ferredoxin and the cytochrome P450 
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enzyme (Lewis, D.F., fflavica, P., 2000, BiochinL Biophys. Acta., 1460: 353-374). 
Electrons are transferred from ferredoxin reductase to the ferredoxin and finally to the 
cytochrome P450 enzyme for the splitting of molecular oxygen. 

* 

5 Summary of the Invention 

An object of the present invention is to provide isolated nucleic acid sequences 
encoding epothilone B hydroxylase and variants or mutants thereof and isolated 
nucleic acid sequences encoding ferredoxin or variants or mutants thereof. 

Another object of the present invention is to provide isolated polypeptides 
10 comprising amino acid sequences of epothilone B hydroxylase and variants or 
mutants thereof and isolated polypeptides comprising amino acid sequences of 
ferredoxin and variants or mutants thereof. 

Another object of the present invention is to provide structure coordinates of 
the homology model of the epothilone B hydroxylase. The structure coordinates are 
15 listed in Appendix 1 . This model of the present invention provides a means for 

designing modulators of a biological function of epothilone B hydroxylase as well as 
additional mutants of epothilone B hydroxylase with altered specificities. 

Another object of the present invention is to provide vectors comprising 
nucleic acid sequences encoding epothilone B hydroxylase or a variant or mutant 
20 thereof and/or ferredoxin or a variant or mutant thereof. In a preferred embodiment, 
these vectors further comprise a nucleic acid sequence encoding ferredoxin. 

Another object of the present invention is to provide host cells comprising a 
vector containing a nucleic acid sequence encoding epothilone B hydroxylase or a 
variant or mutant thereof and/or ferredoxin or a variant or mutant thereof. 
25 Another object of the present invention is to provide a method for producing 

recombinant microorganisms that are capable of hydroxylating compounds, and in 
particular epothilones, having a terminal alkyl group to produce compounds having a 
terminal hydroxyalkyl group. 

Another object of the present invention is to provide microorganisms produced 
30 recombinandy which are capable of hydroxylating compounds, and in particular 
epothilones, having a terminal alkyl group to produce compounds having a terminal 
hydroxyalkyl group. 
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Another object of the present invention is to provide methods for 
hydroxylating compounds in these recombinant microorganisms. In particular, the 
present invention provides a method for the preparation of hydroxyalkyl-bearing 
epothilones, which compounds find utility as antitumor agents and as starting 
materials in the preparation of other epothilone analogs. 

Yet another object of the present invention is to provide a compound of 
Formula A: 

s °- 

N 




10 referred to herein as 24-OH epothilone B or 24-OH EpoB, as well as compositions 
and methods for production of compositions comprising the compound of Formula A. 



Brief Description of the Figures 

Figure 1 provides a schematic of the biotransformation as set forth in WO 
15 00/39276, U.S. Application Serial No. 09/468,854, filed December 21, 1999, of 

epothilone B to epothilone F by Amycolatopsis orientalis strain SC15847 (PTA1043). 
Figure 2 shows the nucleic acid sequence alignments of SEQ ID NO:5 through 

SEQ ID NO:22 used to design the PCR primers for cloning of the nucleic acid 

sequence encoding epothilone B hydroxylase. 
20 Figure 3 shows the sequence alignment between epothilone B hydroxylase 

(SEQ ID NO:2) and EryF (PDB code 1JIN chain A; SEQ ID NO:76). The asterisks 

indicate sequence identities, the colons (:) similar residues. 

Figure 4 provides a homology model of epothilone B hydroxylase based upon 

sequence alignment with EryF as shown in Figure 3. 
25 Figure 5 shows an energy plot of the epothilone B hydroxylase model 

(indicated by dashed line) relative to EryF (PDB code 1 JIN; indicated by solid line). 

An averaging window size of 51 residues was used, i.e., the energy at a given residue 
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position is calculated as the average of the energies of the 5 1 residues in the sequence 
that lie with the given residue at the central positions. 

Detailed Description of the Invention 



The present invention relates to isolated nucleic acid sequences and 
polypeptides and methods for obtaining compounds with desired substituents at a 
terminal carbon position. In particular, the present invention provides compositions 
and methods for the preparation of hydroxyalkyl-bearing epothilones, which 

10 compounds find utility as antitumor agents and as starting materials in the preparation 
of other epothilone analogs. 

The term "epothilone," as used herein, denotes compounds containing an 
epothilone core and a side chain group as defined herein. The term "epothilone core," 
as used herein, denotes a moiety containing the core structure (with the numbering of 

15 ring system positions used herein shown): 



5 




wherein the substituents are as follows: 



Q is selected from the group consisting of 



20 




and 



WisOorNR*; 

X is selected from the group consisting of O, H and OR7; 
Mis O, S, NR8,CR 9 Rio; 

Bi and B 2 are selected from the group consisting of ORn, OCOR12; 
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R1-R5 and R12-R17 are selected from the group consisting of H, alkyl, 
substituted alkyl, aryl, and heterocyclo, and wherein Ri and R2 are alkyl they can be 
joined to form a cycloalkyl; 

R$ is selected from the group consisting of H, alkyl, and substituted alkyl; 
5 R 7 and Rn are selected from the group consisting of H, alkyl, substituted 

alkyl, trialkylsilyl, alkyldiarylsilyl and dialkylarylsilyl; 

R« is selected from the group consisting of H, alkyl, substituted alkyl, R13OO, 
R14OOO and R15SO2; and 

R 9 and Rio are selected from the group consisting of H, halogen, alkyl, 
10 substituted alkyl, aryl, heterocyclo, hydroxy, Ri 6 C=0, and RnOC=0. 

The term "side chain group" refers to substituent G as defined above for 
Epothilone A or B or Gi and G 2 as shown below. 
Gi is the following formula V 

HO-CHHAl)n-(Q)m"(A2)o (V), 

15 and 

G2 is the following formula VI 

CH 3 -(Ai)„-(Q) m -(A 2 ) 0 (VI), 

where 

Ai and A2 are independently selected from the group of optionally substituted 
20 C1-C3 alkyl and alkenyl; 

Q is optionally substituted ring system containing one to three rings and at 
least one carbon to carbon double bond in at least one ring; and 

n, m, and o are integers independently selected from the group consisting of 
zero and 1, where at least one of m, n or o is 1. 
25 The term "terminal carbon" or "terminal alkyl group*' refers to the terminal 

carbon or terminal methyl group of the moiety either directly bonded to the epothilone 
core at position 15 or to the terminal carbon or terminal alkyl group of the side chain 
group bonded at position 15. It is understood that the term "alkyl group" includes 
alkyl and substituted alkyl as defined herein. 
30 The term "alkyl" refers to optionally substituted, straight or branched chain 

saturated hydrocarbon groups of 1 to 20 carbon atoms, preferably 1 to 7 carbon atoms. 
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The expression 'lower alkyl" refers to optionally substituted alkyl groups of 1 to 4 
carbon atoms. 

The term "substituted alkyl" refers to an alkyl group substituted by, for 
example, one to four substituents, such as, halo, trifluoromethyl, trifluoromethoxy, 

5 hydroxy, alkoxy, cycloalkyloxy, heterocyclooxy, oxo, alkanoyl, aryloxy, alkanoyloxy, 
amino, alkylamino, arylamino, aralkylamino, cycloalkylamino, heterocycloamino, 
disubstituted amines in which the 2 amino substituents are selected from alkyl, aryl or 
aralkyl, alkanoylamino, aroylamino, aralkanoylamino, substituted alkanoylamino, 
substituted arylamino, substituted aralkanoylamino, thiol, alkylthio, arylthio, 

10 aralkylthio, cycloalkylthio, heterocyclothio, alkylthiono, arylthiono, aralkylthiono, 
alkylsulfonyl, arylsulfonyl, aralkylsulfonyl, sulfonamido (e.g. SO2NH2), substituted 
sulfonamido, nitro, cyano, carboxy, carbamyl (e.g. CONH2), substituted carbamyl 
(e.g. CONH alkyl, CONH aryl, CONH aralkyl or cases where there are two 
substituents on the nitrogen selected from alkyl, aryl or aralkyl), alkoxycarbonyl, aryl, 

15 substituted aryl, guanidino and heterocyclos, such as, indolyl, imidazolyl, furyl, 
thienyl, thiazolyl, pyrrolidyl, pyridyl, pyrimidyl and the like. Where noted above 
where the substituent is further substituted it will be with halogen, alkyl, alkoxy, aryl 
or aralkyl. 

In accordance with one aspect of the present invention there are provided 
20 isolated polynucleotides that encode epothilone B hydroxylase, an enzyme capable of 
hydroxylating epothilones having a terminal alkyl group to produce epothilones 
having a terminal hydroxyalkyl group. 

In accordance with another aspect of the present invention there are provided 
isolated polynucleotides that encode a ferredoxin, the gene for which is located 
25 downstream from the epothilone B hydroxylase gene. Ferredoxin is a protein of the 
cytochrome P450 system. 

By "polynucleotides", as used herein, it is meant to include any form of DNA 
or RNA such as cDNA or genomic DNA or mRNA, respectively, encoding these 
enzymes or an active fragment thereof which are obtained by cloning or produced 
30 synthetically by well known chemical techniques. DNA may be double- or single- 
stranded. Single-stranded DNA may comprise the coding or sense strand or the non- 
coding or antisense strand. Thus, the term polynucleotide also includes 
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polynucleotides exhibiting at least 60% or more, preferably at least 80%, homology to 
sequences disclosed herein, and which hybridize under stringent conditions to the 
above-described polynucleotides. As used herein, the term "stringent conditions" 
means hybridization conditions of 60°C at 2xSSC buffer. More preferred are isolated 
5 nucleic acid molecules capable of hybridizing to the nucleic acid sequence set forth in 
1, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 70, 72, or 74 or SEQ ID 
NO:3, or to the complementary sequence of the nucleic acid sequence set forth in 
SEQ ID NO:l, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62 ,64, 66, 68, 70, 72 ,or 74 
or SEQ ID NO:3, under hybridization conditions of 3X SSC at 65°C for 16 hours, 

10 and which are capable of remaining hybridized to the nucleic acid sequence set forth 
in SEQ ID NO:l, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 70, 72 or 74 
or SEQ ID NO:3, or to the complementary sequence of the nucleic acid sequence set 
forth in SEQ ID NO: 1 , 30, 32, 34, 36, 37, 38, 39, 40, 41 or 42, 60, 62, 64, 66, 68, 70, 
72 or 74 or SEQ ID NO:3, under wash conditions of 0.5X SSC, 55°C for 30 minutes. 

15 la one embodiment, a polynucleotide of the present invention comprises the 

genomic DNA depicted in SEQ ID NO:l or a homologous sequence or fragment 
thereof which encodes a polypeptide having similar activity to that of this epothilone 
B hydroxylase. Alternatively, a polynucleotide of the present invention may comprise 
the genomic DNA depicted in SEQ ID NO:3 or a homologous sequence or fragment 

20 thereof which encodes a polypeptide having similar activity to this ferredoxin. Due to 
the degeneracy of the genetic code, polynucleotides of the present invention may also 
comprise other nucleic acid sequences encoding this enzyme and derivatives, variants 
or active fragments thereof. 

The present invention also relates to variants of these polynucleotides which 

25 may be naturally occurring, i.e., present in microorganisms such as Amycolatopsis 
orientalis and Amycolata autotrophica, or in soil or other sources from which nucleic 
acids can be isolated, or mutants prepared by well known mutagenesis techniques. 
Exemplary variants polynucleotides of the present invention are depicted in SEQ ID 
NO: 36-42. 

30 By "mutants" as used herein it is meant to be inclusive of nucleic acid 

sequences with one or more point mutations, or deletions or additions of nucleic acids 
as compared to SEQ ID NO: 1 or 3, but which still encode a polypeptide or fragment 
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with similar activity to the polypeptides encoded by SEQ ID NO: 1 or 3. Ia a 
preferred embodiment, mutations ate made which alter the substrate specificity and/or 
yield of the enzyme. A preferred region of mutation with respect to the epothilone B 
hydroxylase gene is that region of the nucleic acid sequence coding for the 
5 approximately 113 amino acids residues comprising the active site of the enzyme. 
Also preferred are mutants encoding a polypeptide with at least one amino acid 
substitution at amino acid position GLU31, ARG67, ARG88, BLE92, ALA93, 
VAL106, HJB130, ALA140, MET176, PHE190, GLU 231, SER294, PHE237, or 
LLE365 of SEQ ID NO:l. Exemplary polynucleotide mutants of the present invention 
10 are depicted in SEQ ID NO: 30, 32, 34, 60, 62, 64, 66, 68, 70, 72 and 74. 

Cloning of the nucleic acid sequence of SEQ ID NO:l encoding epothilone B 
hydroxylase was performed using PCR primers designed by aligning the nucleic acid 
sequences of six cytochrome P450 genes from bacteria The following cytochrome 
P450 genes were aligned: 
15 Sequence 1: Locus: STMSUACB; Accession number M32238; Reference: 
Omer, C.A, J. Bacteriol. 172: 3335-3345 (1990) 
Sequence 2: Locus: STMSUBCB; Accession number: M32239; Reference: 

Omer, C.A., J. Bacteriol. 172: 3335-3345 (1990) 
Sequence 3: Locus: AB018074 (formerly STMORFA); Accession number: 
20 AB018074; Reference: Ueda, K., J. Antibiot 48: 638-646 (1995) 

Sequence 4: Locus: SSU65940; Accession number: U65940; Reference: 

Motamedi, H., J. Bacteriol. 178: 5243-5248 (1996) 
Sequence 5: Locus: STMOLEP; Accession number L37200; Reference: 

Rodriguez, AM., FEMS Microbiol. Lett 127: 117-120 (1995) 
25 Sequence 6: Locus: SERCP450A; Accession number: M83 1 10; Reference: 
Andersen, J.F. and Hutchinson, C.R., J. Bacteriol. 174: 725-735 
(1992) 

Alignments were performed using an implementation of the algorithm of 
Myers, E.W. and W. Miller. 1988. CABIOS 4:1, 11-17., the Align program from 
30 Scientific and Educational Software (Durham, North Carolina, USA). Three highly . 
conserved regions were identified in the I-helix, containing the oxygen binding 
domain, in the K-helix, and spanning the B-bulge and L-helix containing the 
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conserved heme binding domain. Primers were designed to the three conserved 
regions identified in the alignment Primers P450-1* (SEQ ID NO:23) and P4504a + 
(SEQ ID NO:24) were designed from the I helix, Primer P450-2+ (SEQ ID NO:25) 
was designed from the B-Bulge and L-helix region and Primer P450-3~( SEQ ID 
5 NO:27) was designed as the reverse complement to the heme binding protein. 

Genomic fragments were then amplified via polymerase chain reaction (PCR). 
After PCR amplification, the reaction products were separated by gel electrophoresis 
and fragments of the expected size were excised. The DNA was extracted from the 
agarose gel slices using the Qiaquick gel extraction procedure (Qiagen, Santa Clarita, 

10 California, USA). The fragments were then cloned into the PCRscript vector 
(Stratagene, La Jolla, California, USA) using the PCRscript Amp cloning kit 
(Stratagene). Colonies containing inserts were picked to 1-2 ml of LB broth with 100 
\ig/ml ampicillin, 30-37°C, 16-24 hours, 230-300 rpm. Plasmid isolation was 
performed using the Mo Bio miniplasmid prep kit (Mo Bio, Solano Beach, California, 

15 USA). This plasmid DNA was used as a PCR and sequencing template and for 
restriction digest analysis. 

The cloned PCR products were sequenced using the Big-Dye sequencing kit 
from Applied Biosystems, (Foster City, California, USA) and were analyzed using the 
ABB 10 sequencer (Applied Biosystems, Foster City, California, USA). The sequence 

20 of the inserts was used to perform a TblastX search, using the protocol of Altschul, 
Si 7 , et al., Mol. Biol. 215:403-410 (1990), of the non-redundant protein database. 
Unique sequences having a significant similarity to known cytochrome P450 proteins 
were retained. Using this approach, a total of nine different P450 sequences were 
identified from SCI 5847, seven from the genomic DNA template and two from the 

25 cDNA. Two P450 sequences were found in common between the DNA and cDNA 
templates. Of the fifty cDNA clones analyzed, two sequences were predominant, 
with twenty clones each. These two genes were then cloned from the genomic DNA. 

The nucleic acid sequence of the genomic DNA was determined using the 
Big-Dye sequencing system (Applied Biosystems) and analyzed using an ABB 10 

30 sequencer. This sequence is depicted in SEQ ID NO: 1. An open reading frame 

coding for a protein of 404 amino acids and a predicted molecular weight of 44.7 kDa 
was found within the cloned Bgin fragment The deduced amino acid sequence of 
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ttiis polypeptide is depicted in SEQ ID NO: 2. The amino acid sequence of this 
polypeptide was found to share 51% identity with the NifcF protein of Streptomyces 
tendae (Bruntner, C et al, 1999, Mol. Gen. Genet 262: 102-1 14) and 48% identity 
with the Sca-2 protein of S. carbophilus (Watanabe, I. Et al, 1995, Gene 163: 81-85). 
5 Both of these enzymes belong to the cytochrome P450 family 105. The invariable 
cysteine found in the heme-binding domain of all cytochrome P450 enzymes is found 
at residue 356. This gene for epothilone B hydroxylase has been named ebh. The 
ATG start codon of a putative ferredoxin gene of 64 amino acids is found nine 
basepairs downstream from the stop codon of ebh. This enzyme was found to share 
10 50% identity with ferredoxin genes of S. griseoulus (O'Keefe, DP., et al, 1991, 
Biochemistry 30: 447-455) and S. noursei (Brautaset, T., et al, 2000, Chem. Biol. 7: 
395-403). The nucleic acid sequence encoding this ferredoxin is depicted in SEQ ID 
NO:3 and the amino acid sequence for this ferredoxin polypeptide is depicted in SEQ 
IDNO:4. 

15 The ebh gene sequence was also used to isolate variant cytochrome P450 

genes from other microorganisms. Exemplary variant polynucleotides *Wi43491, 
*&M4930, e6ft53630, eM53550, ebh39444, eM43333 and e&/*35165 of the present 
invention and the species from which they were isolated are depicted in Table 1 
below. The nucleic acid sequences for these variants are depicted in SEQ ID NO:36- 

20 42, respectively. 

Table 1: Variant polynucleotides 



ATCCID 


Species 


ebh gene designation 


43491 


Amycolatopsis orientalis 


eM43491 


14930 


Amycolatopsis orientaUs 


«M14930 


53630 


Amycolatopsis orientaUs 


ebhS3630 


53550 


Amycolatopsis orientaUs 


ebhS35S0 


39444 


Amycolatopsis orientaUs 


ebh39444 


43333 


Amycolatopsis orientaUs 


«Mt43333 


35165 


Amycolatopsis orientaUs 


ebh3S!65 



The amino acid sequences encoded by the exemplary variants ebh4349l, 
e&M4930, eM53630, ebh53550, ebh39444, eM43333 and ebh35l65 are depicted in 
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SEQ ID NO:43-49, respectively. Table 2 provides a summary of the amino acid 
substitutions of these exemplary variants. 
Table 2: Amino acid Substitutions 



Position 


ebh 


Substitution 


ebh variant 


100 


Gly 


Ser 


eWil4930, efc/i43333, eWi53550, eWt43491 


101 


Lys 


Arg 


eWil4930 


130 


lie 


Leu 


eMtl4930 


192 


Ser 


Gin 


eM14930 


224 


Ser 


Thr 


<?Wil4930, ebh43333, ebh53550, ebhA3A9l 


285 


lie 


Val 


eM14930, ebhA3333, eM53550, eMi43491 


69 


Ser 


Asn 


ebh\3333 


256 


Val 


Ala 


ebh43333, ebh53550, eM43491 


93 


Ala 


Ser 


eM53550 


326 


Asp 


Glu 


eW»53550,eM43491 


333 


Thr 


Ala 


eWt53550, ebhA3W\ 


133 


Leu 


Met 


eMt43491 


398 


His 


Arg 


eWt39444 



5 Mutations were also introduced into the coding region of the ebh gene to 

identify mutants with improved yield, and/or rate of bioconversion and/or altered 
substrate specificity. Exemplary mutant nucleic acid sequences of the present 
invention are depicted in SEQ ID NO:30, 32, 34, 60, 62, 64, 66, 68, 70, 72 and 74. 

The nucleic acid sequence of SEQ ID NO:30 encodes a mutant db/i25-l which 

10 exhibits altered substrate specificity. Plasmid pANT849eWz25-l containing this 
mutant gene was deposited and accepted by an International Depository Authority 
under the provisions of the Budapest Treaty. The deposit was made on November 21, 
2p02 to the American Type Culture Collection at 10801 University Boulevard in 
Manassas, Virginia 201 10-2209. The ATCC Accession Number is PTA4809. All 

15 restrictions upon public access to this plasmid will be irrevocably removed upon 
granting of this patent application. The Deposit will be maintained in a public 
depository for a period of thirty years after the date of deposit or five years after the 
last request for a sample or for the enforceable life of the patent, whichever is longer. 
The above-referenced plasmid was viable at the time of the deposit The deposit will 

20 be replaced if viable samples cannot be dispensed by the depository. 

This S. lividans transformant identified in the screening of mutation 25 
(primers NPB29-mut25f (SEQ ID NO:58) and NPB29-mut25r (SEQ ID NO:59)) was 
found to produce a product with a different HPLC elution time than epothilone B or 
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epothilone R A sample of this unknown was analyzed by LC-MS and was found to 
have a molecular weight of 523 (M.W.), consistent with a single hydroxylation of 
epothilone B. Plasmid DNA was isolated from the S. lividans culture and used as a 
template for PCR amplification using primers NPB29-6f (SEQ ID NO:28) and 
5 NPB29-7r (SEQ ID NO:29) (see Example 17). The expected fragment was obtained 
and sequenced using the Big-Dye sequencing system. The ebK25A mutant was found 
to have two mutations resulting in changes in the amino acid sequence of the protein, 
asparagine 195 is changed to serine and serine 294 is changed to proline. The position 
targeted for mutation at codon 238 was found to have a two nucleotide change, which 
10 did not result in a change of the amino acid sequence of the protein. The amino acid 
sequence of the mutanf polypeptide encoded by SEQ ID NO:30 is depicted in SEQ ID 
NO:31. 

The nucleic acid sequence of SEQ ID NO:32 encodes a mutant efcMO-53, 
which exhibits improved bioconversion yield. This S. lividans transfonnant identified 

15 in the screening of mutation 10 (primers NPB29-mutl0f (SEQ ID NO:54) and 

NPB29-mutl0r (SEQ ID NO:55)) produced a greater yield of epothilone F. Plasmid 
DNA was isolated from the S. lividans culture and used as a template for PCR 
amplification using primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID 
NO:29)(see Example 16). The expected fragment was obtained and sequenced using 

20 the Big-Dye sequencing system. The ebhl0~53 mutant was found to have two 

mutations resulting in changes in the amino acid sequence of the protein, glutamic 
acid 231 is changed to arginine and phenylalanine 190 is changed to tyrosine. The 
position 231 was the target of the mutagenesis, the change at residue 190 is an 
inadvertent change that is an artifact of the mutagenesis procedure. The amino acid 

25 sequence of the mutant polypeptide encoded by SEQ ID NO:32 is depicted in SEQ ID 
NO:33. 

The nucleic acid sequence of SEQ ID NO:34 encodes a mutant eM24-16, . 
which also exhibits improved bioconversion yield. This S. lividans transfonnant, 
ebh2A-l6 identified in the screening of mutation 24 (primers NPB29-mut24f (SEQ ID 
30 NO:56) and NPB29-mut24r (SEQ ID NO:57) also produced a greater yield of 
epothilone F. Plasmid DNA was isolated from the S. lividans culture and used as a 
template for PCR amplification using primers NPB29-6f (SEQ ID NO:28) and 
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NPB29-7r (SEQ ID NO:29). The expected fragment was obtained and sequenced 
using the Big-Dye sequencing system. The eWi24-16 mutant was found to have two 
mutations resulting in changes in the amino acid sequence of the protein, 
phenylalanine 237 is changed to alanine and isoleucine 92 is changed to valine. The 
5 position 237 was the target of the mutagenesis, the change at residue 92 is an 

inadvertent change that is an artifact of the mutagenesis procedure. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO:34 is depicted in SEQ ID 
NO:35. 

The nucleic acid sequence of SEQ ID NO:60 encodes a mutant e&ft24-16d8, 

10 which also exhibits improved bioconversion yield. This 5. rimosus transfonnant, 

ebh2A-16dS identified in the screening of mutation 59 (primer NPB29mut59 (SEQ ID 
NO:70)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the S. rimosus culture and used as a template for PCR amplification using 
primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 

15 fragment was obtained and sequenced using the Big-Dye sequencing system. The 
eW2A -16d8 mutant was found to have one mutation resulting in a change in the 
amino acid sequence of the protein, arginine 67 is changed to glutamine. This change 
is an artifact of the mutagenesis procedure. The amino acid sequence of the mutant 
polypeptide encoded by SEQ ID NO:60 is SEQ ID NO:61. 

20 The nucleic acid sequence of SEQ ID NO:62 encodes a mutant ebh2A-16cl 1, 

which also exhibits improved bioconversion yield. This S. rimosus transfonnant, 
eWz24-16cll identified in the screening of mutation 59 (primer NPB29mut59 (SEQ 
ID NO:70)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the S. rimosus culture and used as a template for PCR amplification using 

25 primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system. The 
e£/z24-16cll mutant was found to have two additional mutations resulting in changes 
in the amino acid sequence of the protein, alanine 93 is changed to glycine and 
isoleucine 365 is changed to threonine. The position 93 is the target of the 

30 mutagenesis, the change at 365 is an artifact of the mutagenesis procedure. The 
amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:62 is 
depicted in SEQ ID NO:63. 
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The nucleic acid sequence of SEQ ID NO:64 encodes a mutant eWt24-16-16, 
which also exhibits improved bioconversion yield. This 5. rimosus transformant, 
ebK2A-\6-\6 identified in the screening of random mutants of ebh24-l6 also 
produced a greater yield of epothilone F. Plasmid DNA was isolated from the S. 
5 rimosus culture and used as a template for PCR amplification using primers NPB29- 
6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected fragment was 
obtained and sequenced using the Big-Dye sequencing system. The efc/i24-16-16 
mutant was found to have one additional mutation resulting in changes in the amino 
acid sequence of the protein, valine 106 is changed to alanine. The amino acid 
10 sequence of the mutant polypeptide encoded by SEQ ID NO:64 is depicted in SEQ ID 
NO:65. 

The nucleic acid sequence of SEQ ID NO:66 encodes a mutant *Wi24-16-74, 
which also exhibits improved bioconversion yield. This & rimosus transformant, 
ebh2A-l6-74 identified in the screening of random mutants of ebh24-16 also 

15 produced a greater yield of epothilone F. Plasmid DNA was isolated from the S. 

rimosus culture and used as a template for PCR amplification using primers NPB29- 
6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected fragment was 
obtained and sequenced using the Big-Dye sequencing system. The eWi24-16-74 
mutant was found to have one additional mutation resulting in changes in the amino 

20 acid sequence of the protein, arginine 88 is changed to histidine. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO:66 is SEQ ID NO:67. 
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The nucleic acid sequence of SEQ ID NO:68 encodes a mutant dWi24-M18, 
which also exhibits improved bioconversion yield This S. riniosus transformant, 
ebhMrlZ identified in the screening of random mutants of ebh also produced a 
greater yield of epothilone R Plasmid DNA was isolated from the S. rimosus culture 
5 and used as a template for PCR amplification using primers NPB29-6f (SEQ ID 
NO:28) and NPB29-7r (SEQ ID NO:29). The expected fragment was obtained and 
sequenced using the Big-Dye sequencing system. The cMM-18 mutant was found to 
have two mutations resulting in changes in the amino acid sequence of the protein, 
glutamic acid 31 is changed to lysine and methionine 176 is changed to valine. The 

10 amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:68 is 
depicted in SEQ ID NO:69. 

The nucleic acid sequence of SEQ ID NO:72 encodes a mutant ebh2A-l6g%> 
which also exhibits improved bioconversion yield. This S. rimosus transformant, 
db/i24-16g8 identified in the screening of mutation 50 (primer NPB29mut50 (SEQ ID 

15 NO:71)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the S. rimosus culture and used as a template for PCR amplification using 
primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system. The 
£&7i24-16g8 mutant was found to have two additional mutations resulting in changes 

20 in the amino acid sequence of the protein, methionine 176 is changed to alanine and 
isoleucine 130 is changed to threonine. The position 176 is the target of the 
mutagenesis, the change at 130 is an artifact of the mutagenesis procedure. The 
amino acid sequence of the mutant polypeptide encoded by SEQ ID NO:72 is 
depicted in SEQ ID NO:73. 

25 The nucleic acid sequence of SEQ ID NO:74 encodes a mutant eta24-16b9, 

which also exhibits improved bioconversion yield. This S. rimosus transformant, 
ebh2A-l6b9 identified in the screening of mutation 50 (primer NPB29mut50 (SEQ ID 
NO:71)) also produced a greater yield of epothilone F. Plasmid DNA was isolated 
from the 5. rimosus culture and used as a template for PCR amplification using 

30 primers NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29). The expected 
fragment was obtained and sequenced using the Big-Dye sequencing system- The 
e&/i24-16b9 mutant was found to have two additional mutations resulting in changes 
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in the amino acid sequence of the protein, methionine 176 is changed to serine and 
alanine 140 is changed to threonine* The position 176 is the target of the mutagenesis, 
the change at 140 is an artifact of the mutagenesis procedure. The amino acid 
sequence of the mutant polypeptide encoded by SEQ ID NO:74 is depicted in SEQ ID 
5 NO:75. 

A mixture composed of the plasmids pANT849eWi-24-16, pANT849eW*-10- 
53, pANT849eWi-24-16d8 > pANT849eMi-24-16cl 1, pANT849efc/i-24-16-16, 
pant849e&/i-24-16-74, pANT849eM-24-16b9, pANT849eta-M18 and pANT849<?M- 
24-16g8 for these nine mutant genes was deposited and accepted by an International 

10 Depository Authority under the provisions of the Budapest Treaty. The deposit was 
made on November 21, 2002 to the American Type Culture Collection at 10801 
University Boulevard in Manassas, Virginia 201 10-2209. The ATCC Accession 
Number is PTA4808. All restrictions upon public access to this mixture of plasmids 
will be irrevocably removed upon granting of this patent application. The deposit will 

15 be maintained in a public depository for a period of thirty years after the date of 
deposit or five years after the last request for a sample or for the enforceable life of 
the patent, whichever is longer. The above-referenced mixture of plasmids was viable 
at the time of the deposit The deposit will be replaced if viable samples cannot be 
dispensed by the depository. 

20 Thus, in accordance with another aspect of the present invention, there are 

provided isolated polypeptides of epothilone B hydroxylase and variants and mutants 
thereof and isolated polypeptides of ferredoxin or variants thereof. In one 
embodiment of the present invention, by "polypeptide" it is meant to include the 
amino acid sequence of SEQ ID NO: 2, and fragments or variants, which retain 

25 essentially the same biological activity and/or function as this epothilone B 

hydroxylase. In another embodiment of the present invention, by '•polypeptide" it is 
meant to include the amino acid sequence of SEQ ID NO:4, and fragments and/or 
variants, which retain essentially the same biological activity and/or function as this 
ferredoxin. 

30 By 'Variants" as used herein it is meant to include polypeptides with amino 

acid sequences with conservative amino acid substitutions as compared to SEQ ID 
NO: 2 or 4 which are demonstrated to exhibit similar biological activity and/or 
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function to SEQ ID NO:2 or 4. By "conservative amino acid substitutions" it is 
meant to include replacement, one for another, of the aliphatic amino acids such as 
Ala, Val, Leu and He, the hydroxy! residues Ser and Thr, the acidic residues Asp and 
Glu, and the amide residues Asn and Gin. Exemplary variant amino acid sequences 
5 of the present invention are depicted in SEQ ID NO:43-49 and the amino acid 
substitutions of these exemplary variants are described in Table 2, supra. 

By "mutants" as used herein it is meant to include polypeptides encoded by 
nucleic acid sequences with one or more point mutations, or deletions or additions of 
nucleic acids as compared to SEQ ID NO: 1 or 3, but which still have similar activity 

10 to the polypeptides encoded by SEQ ID NO: 1 or 3. In a preferred embodiment, 
mutations are made to the nucleic acid that alter the substrate specificity and/or yield 
from the polypeptide encoded thereby. A preferred region of mutation with respect to 
the epothilone B hydroxylase gene is that region of the nucleic acid sequence coding 
for the approximately 1 13 amino acid residues comprising the active site of the 

15 enzyme. Also preferred are mutants with at least one amino acid substitution at 
amino acid position GLU31, ARG67, ARG88, ILE92, ALA93, VAL106, DLE130, 
ALA140, MET176, PHE190, GLU 231, SER294, PHE237, or ILE365 of SEQ ID 
NO:l Exemplary mutants ebh25-l t eWilO-53, ebh2A-l6 9 eW*24-16d8, e&A24-16cll, 
e&A24-16-16, e&A24-16-74, eM24-16g8, <?M24-16b9 and the nucleic acid sequences 

20 encoding such mutants of the present invention are depicted in SEQ ID NO:3 1 , 33, 
35, 61, 63, 65, 67, 69, 71, 73 and 75, and SEQ ID NO:30, 32, 34, 60, 62, 64, 66, 68, 
70, 72 and 74, respectively. 

A 3-dimensional model of epothilone B hydroxylase has also been constructed 
in accordance with general teachings of Greer et al. (Comparative modeling of 

25 homologous proteins. Methods In Enzymology 202239-52, 1991), Lesk et al. 
(Homology Modeling: Inferences from Tables of Aligned Sequences. Curr. Op. 
Struc. Biol. (2) 242-247, 1992), and Cardozo et al. (Homology modeling by the ICM 
method. Proteins 23, 403-14, 1995) on the basis of the known structure of a 
homologous protein EryF (PDB Code 1KBST chain A). Homology between these 

30 sequences is 34%. Alignment of the sequences of epothilone B hydroxylase (SEQ ID 
NO:2) and EryF (PDB Code HON chain A; SEQ ID NO:76) is depicted in Figure 3. 
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A homology model of epothilone B hydroxylase based upon sequence alignment with 
EryF is depicted in Figure 4. 

An energy plot of the epothilone B hydroxylase model relative to EryF (PDB 
code 1 JIN) was also prepared and is depicted in Figure 5. An averaging window size 
5 of 5 1 residues was used at a given residue position to calculate the average of the 

energies of the 51 residues in the sequence that lie with the given residue at the central 
position. As shown in Figure 5, all energies along the sequence lie below zero thus 
indicating that the modeled structure as set forth in Figure 4 and Appendix 1 is 
reasonable. 

10 The three-dimensional structure represented in the homology model of 

epothilone B hydroxylase of Figure 4 is defined by a set of structure coordinates as set 
forth in Appendix 1. The term "structure coordinates" refers to Cartesian coordinates 
generated from the building of a homology model. As will be understood by those of 
skill in the art, however, a set of structure coordinates for a protein is a relative set of 

15 points that define a shape in three dimensions. Thus, it is possible that an entirely 
different set of coordinates could define a similar or identical shape. Moreover, slight 
variations in the individual coordinates, as emanate from generation of similar 
homology models using different alignment templates and/or using different methods 
in generating the homology model, will have minor effects on the overall shape. 

20 Variations in coordinates may also be generated because of mathematical 

manipulations of the structure coordinates. For example, the structure coordinates set 
forth in Appendix 1 could be manipulated by fractionalization of the structure 
coordinates; integer additions or subtractions to sets of the structure coordinates, 
inversion of the structure coordinates or any combination of the above. 

25 Various computational analyses are therefore necessary to determine whether 

a molecule or a portion thereof is sufficiently similar to all or parts of epothilone B 
hydroxylase described above as to be considered the same. Such analyses may be 
carried out in current software applications, such as SYBYL version 6.7 or 
ENSIGHTII (Molecular Simulations Inc., San Diego, CA) version 2000 and as 

30 described in the accompanying User's Guides. 

For example, the superimposition tool in the program SYBYL allows 
comparisons to be made between different structures and different conformations of 
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the same structure. The procedure used in SYBYL to compare structures is divided 
into four steps: 1) load the structures to be compared; 2) define the atom equivalencies 
in these structures; 3) perform a fitting operation; and 4) analyze the results. Each 
structure is identified by a name. One structure is identified as the target (i.e., the 

5 fixed structure); the second structure (i.e., moving structure) is identified as the source 
structure. Since atom equivalency within SYBYL is defined by user input, for the 
purpose of this aspect of the present invention equivalent atoms are defined as protein 
backbone atoms (N, Cot, C and O) for all conserved residues between the two 
structures being compared. Further, only rigid fitting operations are considered . 

10 When a rigid fitting method is used, the working structure is translated and rotated to 
obtain an optimum fit with the target structure. The fitting operation uses an algorithm 
that computes the optimum translation and rotation to be applied to the moving 
structure, such that the root mean square difference of the fit over the specified pairs 
of equivalent atoms is an absolute minimum. This number, given in angstroms, is 

15 reported by SYBYL. 

For the purposes of the present invention, any homology model of epothilone 
B hydroxylase that has a root mean square deviation of conserved residue backbone 
atoms (N, Co, C, O) of less than about 4.0 A when superimposed on the 
corresponding backbone atoms described by structure coordinates listed in Appendix 

20 1 are considered identical. More preferably, the root mean square deviation is less 
than about 3.0 A. More preferably the root mean square deviation is less than about 
2.0 A. 

For the purpose of this invention, any homology model of epothilone B 
hydroxylase that has a root mean square deviation of conserved residue backbone 
25 atoms (N, Co, C, O) of less than about 2.0 A when superimposed on the 

corresponding backbone atoms described by structure coordinates listed in Appendix 
1 are considered identical. More preferably, the root mean square deviation is less 
than about 1.0 A. 

In another embodiment of the present invention, structural models wherein 
30 backbone atoms have been substituted with other elements which when superimposed 
on the corresponding backbone atoms have low root mean square deviations are 
considered to be identical. For example, an homology model where the original 
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backbone carbon, and/or nitrogen and/or oxygen atoms are replaced with other 
elements having a root mean square deviation of about 4.0 A, more preferably about 
3.0 A, even more preferably less than about 2A, when superimposed on the 
corresponding backbone atoms described by structure coordinates listed in Appendix 

5 1 is considered identical. 

The term "root mean square deviation" means the square root of the arithmetic 
mean of the squares of the deviations from the mean. It is a way to express the 
deviation or variation from a trend or object. For purposes of this invention, the "root 
mean square deviation" defines the variation in the backbone of a protein from the 

10 relevant portion of the backbone of the epothilone B hydroxylase portion of the 
complex as defined by the structure coordinates described herein. 

The present invention as embodied by the homology model enables the 
structure-based design of additional mutants of epothilone B hydroxylase. For 
example, using the homology model of the present invention, residues lying within 

15 lOA of the binding site of epothilone B hydroxylaseihave now been defined. These 
residues include LEU39, GLN43, ALA45, MET57, LEU58, HIS62, PHE63, SBR64, 
SER65, ASP66, ARG67, GLN68, SER69, LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87, ARG88, PR089, SER90, 
LEU91, ILE92, ALA93, MBT94, ASP95, HK99, ARG103, PHE110, HJE155, 

20 PHE169, GLN170, CYS172, SER173, SER174, ARG175, MET176, LEU177, 
SER178, ARG179, ARG186, PHE190, LEU193, VAL233, GLY234, LEU235, 
ALA236, PHE237, LEU238, LEU239, LEU240, ILE241, ALA242, GLY243, 
HES244, GLU245, THR246, THR247, ALA248, ASN249, MET250, LEU283, 
THR287, ILE288, ALA289, GLU290, THR291, ALA292, THR293, SER294, 

25 ARG295, PHE296, ALA297, THR298, GLU312, GLY313, VAL314, VAL315, 
GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350, HIS351, 
GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392,THR393, ILE394 and TYR395 as set forth in Appendix 
1. Mutants with mutations at one or more of these positions are expected to exhibit 

30 altered biological function and/or specificity and thus comprise another embodiment 
of preferred mutants of the present invention. Another embodiment of preferred 
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mutants are molecules that have a root mean square deviation from the backbone 
atoms of said epothilone B hydroxylase of not more than about 4.0A. 

The structure coordinates of an epothilone B hydroxylase homology model or 
portions thereof are stored in a machine-readable storage medium. Such data may be 
5 used for a variety of purposes, such as drug discovery. 

Accordingly, another aspect of the present invention relates to machine- 
readable data storage medium comprising a data storage material encoded with the 
structure coordinates set forth in Appendix 1. 

The three-dimensional model structure of epothilone B hydroxylase can also 
10 be used to identify modulators of biological function and potential substrates of the 
enzyme. Various methods or combinations thereof can be used to identify such 
modulators. 

For example, a test compound can be modeled that fits spatially into a binding 
site in epothilone B hydroxylase, according to Appendix 1. Structure coordinates of 

15 amino acids within 10 A of the binding region of epothilone B hydroxylase defined by 
amino acids LBU39, GLN43, ALA45, MET57, LEU58, HIS62, PHE63, SER64, 
SER65, ASP66, ARG67, GLN68, SER69, LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, ILE80, ASP84, LYS85, PRO86, PHE87, ARG88, PR089, SER90, 
LEU91, ILE92, ALA93, MET94, ASP95, HIS99, ARG103, PHE1 10, DLE155, 

20 PHE169, GLN170, CYS172, SER173, SER174, ARG175, MET176, LEU177, 
SER178, ARG179, ARG186, PHE190, LEU193, VAL233, GLY234, LEU235, 
ALA236, PHE237, LEU238, LEU239, LEU240, ILE241, ALA242, GLY243, 
HIS244, GLU245, THR246, THR247, ALA248, ASN249, MET250, LEU283, 
THR287, ILE288, ALA289, GLU290, THR291, ALA292, THR293, SER294, 

25 ARG295, PHE296, ALA297, THR298, GLU312, GLY313, VAL314, VAL315, 
GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350, HK351, 
GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392,THR393, ILE394 and TYR395, and the coordinated 
heme group, HEM1 can also be used to identify desirable structural and chemical 

30 features of such modulators. Identified structural or chemical features can then be 
employed to design or select compounds as potential epothilone B hydroxylase 
ligands. By structural and chemical features it is meant to include, but is not limited 
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to, covalent bonding, van der Waals interactions, hydrogen bonding interactions, 
charge interaction, hydrophobic bonding interaction, and dipole interaction. 
Compounds identified as potential epothilone B hydroxylase ligands can then be 
synthesized and screened in an assay characterized by binding of a test compound to 

5 epothilone B hydroxylase, or in characterizing the ability of epothilone B hydroxylase 
to modulate a protease target in the presence of a small molecule. Examples of assays 
useful in screening of potential epothilone B hydroxylase ligands include, but are not 
limited to, screening in silico, in vitro assays and high throughput assays. 

As will be understood by those of skill in the art upon this disclosure, other 

10 structure-based design methods can be used. Various computational structure-based 
design methods have been disclosed in the art. For example, a number of computer 
modeling systems are available in which the sequence of epothilone B hydroxylase 
and the epothilone B hydroxylase structure (i.e., atomic coordinates of epothilone B 
hydroxylase as provided in Appendix 1 and/or the atomic coordinates within 10A of 

15 the binding region as provided above) can be input This computer system then 
generates the structural details of one or more these regions in which a potential 
epothilone B hydroxylase modulator binds so that complementary structural details of 
the potential modulators can be determine^ Design in these modeling systems is 
generally based upon the compound being capable of physically and structurally 

20 associating with epothilone B hydroxylase. In addition, the compound must be able 
to assume a conformation that allows it to associate with epothilone B hydroxylase. 
Some modeling systems estimate the potential inhibitory or binding effect of a 
potential epothilone B hydroxylase substrate or modulator prior to actual synthesis 
and testing. 

25 Methods for screening chemical entities or fragments for their ability to 

associate with a given protein target are also well known. Often these methods begin 
by visual inspection of the binding site on the computer screen. Selected fragments or 
chemical entities are then positioned in a binding region of epothilone B hydroxylase. 
Docking is accomplished using software such as INSIGHTH, QUANTA and SYBYL, 

30 following by energy minimization and molecular dynamics with standard molecular 
mechanic force fields such as, MMFF, CHARMM and AMBER. Examples of 
computer programs which assist in the selection of chemical fragment or chemical 
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entities useful in the present invention include, but are not limited to, GRID 

(Goodford, 1985), AUTODOCK (Goodsell, 1990), and DOCK (Kuntz et al. 1982). 

Upon selection of preferred chemical entities or fragments, their relationship 

to each other and epothilone B hydroxylase can be visualized and then assembled into 

5 a single potential modulator. Programs useful in assembling the individual chemical 
i 

entities include, but are not limited to CAVEAT (Bartlett et al. 1989) and 3D 
Database systems (Martin 1992). 

Alternatively, compounds may be designed de novo using either an empty 
active site or optionally including some portion of a known inhibitor. Methods of this 

10 type of design include, but are not limited to LUDI (Bohm 1992) and LeapFrog 
(Tripos Inc., St Louis MO). 

Programs such as DOCK (Kuntz et al. 1982) can be used with the atomic 
coordinates from the homology model to identify potential ligands from databases or 
virtual databases which potentially bind the in the active site binding region which 

15 may therefore be suitable candidates for synthesis and testing. 

Also provided in the present invention are vectors comprising polynucleotides 
of the present invention and host cells which are genetically engineered with vectors 
of the present invention to produce epothilone B hydroxylase or active fragments and 
variants or mutants of this enzyme and/or ferredoxin or active fragments thereof. 

20 Generally, any vector suitable to maintain, propagate or express polynucleotides to 
produce these polypeptides in the host cell may be used for expression in this regard. 
In accordance with this aspect of the invention the vector may be, for example, a 
plasmid vector, a single- or double-stranded phage vector, or a single- or double- 
stranded RNA or DNA viral vector. Vectors may be extra-chromosomal or designed 

25 for integration into the host chromosome. Such vectors include, but are not limited to, 
chromosomal, episomal and virus-derived vectors e.g., vectors derived from bacterial 
plasmids, bacteriophages, yeast episomes, yeast chromosomal elements, and viruses 
such as baculoviruses, papova viruses, SV40, vaccinia viruses, adenoviruses, fowl 
pox viruses, pseudorabies viruses and retroviruses, and vectors derived from 

30 combinations thereof, such as those derived from plasmid and bacteriophage genetic 
elements, cosmids and phagemids. 
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Useful expression vectors for prokaryotic hosts include, but are not limited to, 
bacterial plasmids, such as those from E. coli, Bacillus or Streptomyces, including 
pBluescript, pGEX-2T, pUC vectors, pET vectors, ColEl, pCRl, pBR322, pMB9, 
pCW, pBMS200, pBMS2020, PU101, PIJ702, pANT849, pOJ260, pOJ446, 

5 pSET152, pKCl 139, pKC1218, pFD666 and their derivatives, wider host range 

plasmids, such as RP4, phage DNAs, e.g. 7 the numerous derivatives of phage lambda, 
e.g., NM989, XGT10 and A/3T11, and other phages, e.g., M13 and filamentous single 
stranded phage DN A. 

Vectors of the present invention for use in yeast will typically contain an 

10 origin of replication suitable for use in yeast and a selectable marker that is functional 
in yeast. Examples of yeast vectors useful in the present invention include, but are not 
limited to, Yeast Integrating plasmids (e.g., YIp5) and Yeast Replicating plasmids 
(the YRp and YEp series plasmids), Yeast Centromere plasmids (the YCp series 
plasmids), Yeast Artificial Chromosomes (Y ACs) which are based on yeast linear 

15 plasmids, denoted YLp, pGPD-2, 2\i plasmids and derivatives thereof, and improved 
shuttle vectors such as those described in Gietz et al. 9 Gene, 74: 527-34 (1988) 
(YIplac, YEplac and YCplac). 

Mammalian vectors useful for recombinant expression may include a viral 
origin, such as the SV40 origin (for replication in cell lines expressing the large 

20 T-antigen, such as COS1 and COS7 cells), the papillomavirus origin, or the EBV 
origin for long term episomal replication (for use, e.g., in 293-EBNA cells, which 
constitutively express the EBV EBNA-1 gene product and adenovirus El A). 
Expression in mammalian cells can be achieved using a variety of plasmids, 
including, but not limited to, pSV2, pBC12BI, and p91023, pCDNA vectors as well as 

25 lytic virus vectors (e.g., vaccinia virus, adeno virus, and baculovirus), episomal virus 
vectors (e.g., bovine papillomavirus), and retroviral vectors (e.g. t murine 
retroviruses). Useful vectors for insect cells include baculoviral vectors and pVL941. 

Selection of an appropriate promoter to direct mRNA transcription and 
construction of expression vectors are well known. In general, however, expression 

30 constructs will contain sites for transcription initiation and termination, and, in the 
transcribed region, a ribosome binding site for translation. The coding portion of the 
mature transcripts expressed by the constructs will include a translation initiating 
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codon at the beginning and a termination codon appropriately positioned at the end of 
the polypeptide to be translated. 

Examples of useful promoters for prokaryotes include, but are not limited to 
phage promoters such as phage lambda pL promoter, the trc promoter, a hybrid 

5 derived from the trp and lac promoters, the bacteriophage T7 promoter, the TAC or 
TRC system, the major operator and promoter regions of phage lambda, the control 
regions of fd coat protein, snpA promoter, melC promoter, errriE* promoter or the 
araBAD operon. Examples of useful promoters for yeast include, but are not limited 
to, the CYC1 promoter, the GAL1 promoter, the GAL10 promoter, ADH1 promoter, 

10 the promoters of the yeast a-mating system, and the GPD promoter. Examples of 
promoters routinely used in mammalian expression vectors include, but are not 
limited to, the CMV immediate early promoter, the HS V thymidine kinase promoter, 
the early and late SV40 promoters, the promoters of retroviral LTRs, such as those of 
the Rous Sarcoma Virus(RSV), and metallothionein promoters, such as the mouse 

15 metaUothionein-I promoter. 

Vectors comprising the polynucleotides can be introduced into host cells using 
any number of well known techniques including infection, transduction, transfection, 
transvection and transformation. The polynucleotides may be introduced into a host 
alone or with additional polynucleotides encoding, for example, a selectable marker 

20 or ferredoxin reductase. In a preferred embodiment of the present invention the 
polynucleotide for epothilone B hydroxylase and ferredoxin are introduced into the 
. host cell. Host cells for the various expression constructs are well known, and those 
of skill can routinely select a host cell for expressing the epothilone B hydroxylase 
and/or ferredoxin in accordance with this aspect of the present invention. Examples 

25 of mammalian expression systems useful in the present invention include, but are not 
limited to, the C127, 3T3, CHO, HeLa, human kidney 293 and BHK cell lines, and 
the COS-7 line of monkey kidney fibroblasts. 

Alternatively, as exemplified herein, epothilone B hydroxylase and ferredoxin 
can be expressed recombinantly in microorganisms. 

30 Accordingly, another aspect of the present invention relates to recombinantly 

produced microorganisms which express epothilone B hydroxylase alone or in 
conjunction with the ferredoxin and which are capable of hydroxylating a compound , 
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and in particular an epothilone, having a terminal alkyl group to produce ones having 
a terminal hydroxyalkyl group. The recombinantly produced microorganisms are 
produced by transforming cells such as bacterial cells with a plasmid comprising a 
nucleic acid sequence encoding epothilone B hydroxylase. In a preferred 
5 embodiment, the cells are transformed with a plasmid comprising a nucleic acid 
encoding epothilone B hydroxylase or mutants or variants thereof as well as the 
nucleic acid sequence encoding fenedoxin located downstream of the epothilone B 
hydroxylase gene. Examples of microorganisms which can be transformed with these 
plasmids to produce the recombinant microorganisms of the present invention 

10 include, but are not limited, Escherichia colt, Bacillus megaterium, Amycolatopsis 
orientalis, Sorangium cellulosum, Rhodococcus erythropolis, and Streptomyces 
species such as Streptomyces lividans, Streptomyces virginiae, Streptomyces 
venezuelae, Streptomyces albus, Streptomyces coelicolor, Streptomyces rimosus and 
Streptomyces griseus. 

15 The recombinantly produced microorganisms of the present invention are 

useful in microbial processes or methods for production of compounds, and in 
particular epothilones, containing a terminal hydroxyalkyl group. In general, the 
hydroxyalkyl-bearing product can be produced by culturing the recombinantly 
produced microorganism or enzyme derived therefrom, capable of selectively 

20 hydroxylating a terminal carbon or alkyl, in the presence of a suitable substrate in an 
aqueous nutrient medium containing sources of assimilable carbon and nitrogen, 
under submerged aerobic conditions. 

Suitable epothilones employed as substrate for the method of the present 
invention may be any such compound having a terminal carbon or terminal alkyl 

25 group capable of undergoing the enzymatic hydroxylation of the present invention. 
The starting material, or substrate, can be isolated from natural sources, such as 
Sorangium cellulosum y or they can be synthetically formed epothilones. Other 
substrates having a terminal carbon or terminal alkyl group capable of undergoing an 
enzymatic hydroxylation can be employed by the methods herein. For example, 

30 compactin can be used as a substrate, which upon hydroxylation forms the compound 
pravastatin. Methods for hydroxylating compactin to pravastatin via an 
Actinomadura strain are set forth in U.S. Patent 5,942,423 and U.S. Patent 6,274,360. 
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For example, using the recombinant microorganisms of the present invention 
at least one epothilone can be prepared as described in WO 00/39276, U. S. Serial. 
No. 09/468,854, filed December 21, 1999, the text of which is incorporated herein as 
if set forth at length. An epothilone of the following Formula I 
5 HO-CHr(Ai) n -(Q) m -(A2)o-E (D 

where 

Ai and A2 are independently selected from the group of optionally substituted 
C1-C3 alkyl and alkenyl; 

Q is an optionally substituted ring system containing one to three rings and at 
10 least one carbon to carbon double bond in at least one ring; 

n, m, and o are integers selected from the group consisting of zero and 1, 
where at least one ofmornorois 1; and 

E is an epothilone core; can be prepared. 
This method comprises the steps of contacting at least one epothilone of the following 
15 formula II 

CHs-CAxMCDm^VE (ID 
where Ai, Q, A2, E, n, m, and o are defined as above; 
with a recombinantly produced microorganism, or an enzyme derived 
therefrom, which is capable of selectively catalyzing the hydroxylation of formula H, 
20 and effecting said hydroxylation. 

In a preferred embodiment, the starting material is epothilone B. Epothilone B 
can be obtained from the fermentation of Sorangium celluloswn So ce90, as described 
in DE 41 38 042 and WO 93/10121. The strain has been deposited at the Deutsche 
Sammlung von Mikroorganismen (German Collection of Microorganisms) (DSM) 
25 under No. 6773. The process of fermentation is also described in Hofle, G., et aL, 

Angew. Chem. Int. Ed. Engl, Vol 35, No. 13/14, 1567-1569 (1996). Epothilone B can 
also be obtained by chemical means, such as those disclosed by Meng, D., et aL, /. 
Am. Chem. Soc, Vol. 119, No. 42, 10073-10092 (1996); Nicolaou, K., et aL, J. Am. 
Chem. Soc, Vol. 119, No. 34, 7974-7991 (1997) and Schinzer, D., et aL, Client Eur. 
30 Vol. 5, No. 9, 2483-2491 (1999). 

Growth of the recombinantly produced microorganism selected for use in the 
process may be achieved by one of ordinary skill in the art by the use of appropriate 



-28- 



WO 2004/061116 



PCT/US2003/034082 



nutrient medium. Appropriate media for the growing of the recombinantly produced 
microorganisms include those that provide nutrients necessary for the growth of 
microbial cells. See, for example, T. Nagodawithana and J. M. WasilesM, Chapter 2: 
'Media Design for Industrial Fermentations " Nutritional Requirements of 
5 Commercially Important Microorganism, edited by T. W. Nagodawithana and G. 
Reed, Esteekay Associates, Inc., Milwaukee, WI, 18-45 (1998); T. L. Miller and B. 
W. Churchill, Chapter 10: "Substrates for Large-Scale Fermentations," Manual of 
Industrial Microbiology and Biotechnology, edited by A.L. Demain and N. A. 
Solomon, American Society for Microbiology, Washington, D.C., 122-136 (1986). A 

10 typical medium for growth includes necessary carbon sources, nitrogen sources, and 
trace elements. Inducers may also be added to the medium. The term inducer as used 
herein, includes any compound enhancing formation of the desired enzymatic activity 
within the recombinantly produced microbial cell. Typical inducers as used herein 
may include solvents used to dissolve substrates, such as dimethyl sulfoxide, dimethyl 

15 formamide, dioxane, ethanol and acetone. Further, some substrates, such as 
epothilone B, may also be considered to be inducers. 

Carbon sources may include sugars such as glucose, fructose, galactose, 
maltose, sucrose, mannitol, sorbital, glycerol starch and the like; organic acids such as 
sodium acetate, sodium citrate, and the like; and alcohols such as ethanol, propanol 

20 and the like. Preferred carbon sources include, but are not limited to, glucose, 
fructose, sucrose, glycerol and starch. 

Nitrogen sources may include an N-Z amine A, corn steeped liquor, soybean 
meal, beef extract, yeast extract, tryptone, peptone, cottonseed meal, peanut meal, 
amino acids such as sodium glutamate and the like, sodium nitrate, ammonium sulfate 

25 and the like. 

Trace elements may include magnesium, manganese, calcium, cobalt, nickel, 
iron, sodium and potassium salts. Phosphates may also be added in trace or 
preferably, greater than trace amounts. 

The medium employed for the fermentation may include more than one 
30 carbon or nitrogen source or other nutrient. 

For growth of the recombinantly produced microorganisms and/or 
hydroxylation according to the method of the present invention, the pH of the medium 
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is preferably from about 5 to about 8 and the temperature is from about 14°C to about 
37°C, preferably the temperature is 28°C. The duration of the reaction is 1 to 100 
hours, preferably 8 to 72 hours. 

The medium is incubated for a period of time necessary to complete the 

5 biotransformation as monitored by high performance liquid chromatography (HPLQ. 
Typically, the period of time needed to complete the transformation is twelve to one 
hundred hours and preferably about 72 hours after the addition of the substrate. The 
medium is placed on a rotary shaker (New Brunswick Scientific Innova 5000) 
operating at 150 to 300 rpm and preferably about 250 rpm with a throw of 2 inches. 

10 The hydroxyalkyl-bearing product can be recovered from the fermentation 

broth by conventional means that are commonly used for the recovery of other known 
biologically active substances. Examples of such recovery means include, but are not 
limited to, isolation and purification by extraction with a conventional solvent, such as 
ethyl acetate and the like; by pH adjustment; by treatment with a conventional resin, 

15 for example, by treatment with an anion or cation exchange resin or a non-ionic 
adsorption resin; by treatment with a conventional adsorbent, for example, by 
distillation, by crystallization; or by recrystallization, and the like. 

The extract obtained above from the biotransformation reaction mixture can be 
further isolated and purified by column chromatography and analytical thin layer 

20 chromatography. 

The ability of a recombinantly produced microorganism of the present 
invention to biotransform an epothilone having a terminal alkyl group to an 
epothilone having a terminal hydroxyalkyl group was demonstrated. In these 
experiments, a culture comprising a Streptomyces lividans clone containing a plasmid 

25 with the ebh gene as described in more detail in Example 1 1 was incubated with an 
epothilone B suspension for 3 days at 30°with agitation. A sample of the incubate 
was extracted with an equal volume of 25% methanol: 75% n-butanol, vortexed and 
allowed to settle for 5 minutes. Two hundred |ll of the organic phase was transferred 
to an HPLC vial and analyzed by HPLC/MS (Example 12). A product peak of 

30 epothilone F eluted at a retention time of 15.9 minutes and had a protonated molecular 
weight of 524. The epothilone B substrate eluted at 19.0 minutes and had a 
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protonated molecular weight of 508. Hie peak retention times and molecular weights 
were confirmed using known standards. 

Rates of biotransformation of epothilone B by cells expressing ebh were also 
compared to rates of biotransformation by ebh mutants. Cells expressing ebh 

5 comprised a frozen spore preparation of. S. lividans (pANT849-eM). Cells 

expressing mutants comprises frozen spore preparations of S. lividans (pANT849- 
ebhl0-53) and 5. lividans (pANT849-£&ft24-16). A frozen spore preparation of S. 
lividans TK24 was used as the control. The cells were pre-incubated for several days 
at 30°C. Following this pre-incubation, epothilone B in 100% EtOH was added to 

10 each culture to a final concentration of 0.05% weight/volume- Samples were then 
taken at 0, 24, 48 and 72 hours with the exception of the S. lividans (pANT849- 
ebhlA-\6) culture, in which the epothilone B had been completely converted to 
epothilone Fat 48 hours. The samples were analyzed by HPLC. The results are 
calculated as a percentage of the epothilone B at time 0 hours. 

15 

Epothilone B: 



Time (hoars) 


TK24 


pANT849-eW» 


pANT849-«?WtlO-53 


pANT849-eM24-16 


0 


100% 


100% 


100% 


100% 


24 


99% 


78% 


69% 


56% 


48 


87% 


19% 


39% 


0% 


72 


87% 


0% 


3% 




Epothilone F: 


Time (hoars) 


TK24 


pANT849-e*A 


P ANT849-«6ftl0-53 


P ANT849-e6/i24-16 


0 


0% 


0% 


0% 


0% 


24 


0% 


4% 


9% 


23% 


48 


0% 


21% 


29% 


52% 


72 


0% 


14% 


41% 





20 The ability of cells expressing ebh to biotransform compactin to pravastatin 

was also examined. In these experiments, frozen spore preparations of S. lividans 
(pANT849) or S. lividans (pANT849-eMi) were grown for several days at 30°C. 
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Following the pre-incubation, an aliquot of each cell culture was transferred to a 
polypropylene culture tube, compactin was added to each culture tube, and the tubes 
were incubated for 24 hours, 30°C, 250 rpm. An aliquot of the culture broth was then 
extracted and compactin and pravastatin values relative to the control S. lividans 
(pANT849) culture were measured via HPLC. 

Compactin and pravastatin as a percentage of starting compactin 
concentration: 





S. lividans (pANT849) 


S. lividans (pANT849-eWi) 


Compactin 


36% 


11% 


Pravastatin 


11% 


53% 



As discussed supra, mutant ebh25A (SEQ ID NO:30) exhibits altered 
10 substrate specificity and biotransformation of epothilone B by this mutant resulted in 
a product with a different HPLC elution time than epothilone B or epothilone R A 
sample of this unknown was analyzed by LC-MS and was found to have a molecular 
weight of 523 (M.W.), consistent with a single hydroxylation of epothilone B. The 
structure of the biotransformation product was determined as 24-hydroxyl-epothilone 
15 B, based on MS and NMR data (compared with data of epothilone B): 

26 




24-hydroxyl-epothilone B 
Formula A 

Molecular Formula: C27H41NO7S 
20 Molecular Weight: 523 

Mass Spectrum: ES+ (m/z): 524([M+H] + ), 506. 

LC/MS/MS: +ESI (m/z): 524, 506, 476, 436, 320 

HRMS: Calculated for [M+H] + : 524.2682; Found: 524.2701 
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HPLC (Rt) 
LC/NMR 



10 



7.3 minutes (on the analytical HPLC system) 
Observed Chemical Shifts 
Varian AS-600 (Proton: 599.624 MHz), 
Solvent D2O/CD3CN (8 1.94): -4/6 

Proton: 87.30 (s, 1H), 6.43 (s, 1H), 5.30 (m, 1H), 4.35 (m, 1H), 
3.81 (m, 1H), 3.74 (m, 1H), 3.68 (m, 1H), 3.43 (m, 1H), 2.87 
(m, 1H), 2.66 (s, 3H), 2.40 (m, 2H), 1.58 (b, 1H), 1.48 (b, 1H), 
1.35 (m, 3H), 1.18 (s, 3H), 1.13 (s, 3H), 0.87 (m, 6H) 
♦Peaks between 1.8-2.1 ppm were not observed due to solvent 
suppression. 



The proton chemical shift was assigned as follows: 





Position 


Proton 


FaJ 










15 






m 




3 


4.35 


m 




4 








5 








6 


3.43 


m 


20 


7 


3.68 


m 




8 


1.58 


m 




9 


1.35 


b 




10 


1.48 


b 




10 


1.35 


b 


25 


11 


SSP 






12 








13 


2.87 


m 




14 


SSP 






15 


5.30 


m 


30 


16 








17 


6.43 


s 




18 







-33- 



WO 2004/061116 



PCTAJS2003/034082 



19 


7 30 


s 


90 






91 




c 
o 


99 


1 18 
l.XO 


c 


9^ 


u.o / 


m 
ill 


24 


3.81 


m 


24 


3.74 


m 


25 


0.87 


m 


26 


1.13 


s 


27 


SSP 





*SSP: no observed due to solvent suppression. 

Accordingly, the compositions and methods of the present invention are useful 
in producing known compounds that are microtubule-stabilizing agents as well as new 
compounds comprising epothilone analogs such as 24-hydroxyl-epothilone B 

15 (Formula A) and phannaceutically acceptable salts thereof expected to be useful as 
microtubule-stabilizing agents. The microtubule stabilizing agents produced using 
these compositions and methods are useful in the treatment of a variety of cancers and 
other proliferative diseases including, but not limited to, the following; 

carcinoma, including that of the bladder, breast, colon, kidney, liver, lung, 

20 ovary, pancreas, stomach, cervix, thyroid and skin; including squamous cell 
carcinoma; 

hematopoietic tumors of lymphoid lineage, including leukemia, acute 
lymphocytic leukemia, acute lymphoblastic leukemia, B-cell lymphoma, T-cell 
lymphoma, Hodgkins lymphoma, non-Hodgkins lymphoma, hairy cell lymphoma and 
25 Burketts lymphoma; 

hematopoietic tumors of myeloid lineage, including acute and chronic 
myelogenous leukemias and promyelocytic leukemia; 

tumors of mesenchymal origin, including fibrosarcoma and 
rhabdomyoscarcoma; 
30 - other tumors, including melanoma, seminoma, tetratocarcinoma, 
neuroblastoma and glioma; 



-34- 



WO 2004/061116 PCT/US2003/034082 

tumors of the central and peripheral nervous system, including astrocytoma, 
neuroblastoma, glioma, and schwannomas; 

tumors of mesenchymal origin, including fibrosarcoma, rhabdomyosarcoma, 
and osteosarcoma; and 
5 - other tumors, including melanoma, xenoderma pigmentosum, 

keratoactanthoma, seminoma, thyroid follicular cancer and teratocarcinoma. 

Microtubule stabilizing agents produced using the compositions and methods 
of the present invention will also inhibit angiogenesis, thereby affecting the growth of 
tumors and providing treatment of tumors and tumor-related disorders. Such anti- 

10 angiogenesis properties of these compounds will also be useful in the treatment of 
other conditions responsive to anti-angiogenesis agents including, but not limited to, 
certain forms of blindness related to retinal vascularization, arthritis, especially 
inflammatory arthritis, multiple sclerosis, restinosis and psoriasis. 

Microtubule stabilizing agents produced using the compositions and methods 

15 of the present invention will induce or inhibit apoptosis, a physiological cell death 
process critical for normal development and homeostasis. Alterations of apoptotic 
pathways contribute to the pathogenesis of a variety of human diseases. Compounds 
of the present invention such as those set forth in formula I and II and Formula A, as 
modulators of apoptosis, will be useful in the treatment of a variety of human diseases 

20 with aberrations in apoptosis including, but not limited to, cancer and precancerous 
lesions, immune response related diseases, viral infections, degenerative diseases of 
the musculoskeletal system and kidney disease. 

Without wishing to be bound to any mechanism or morphology, microtubule 
stabilizing agents produced using the compositions and methods of the present 

25 invention may also be used to treat conditions other than cancer or other proliferative 
diseases. Such conditions include, but are not limited to viral infections such as 
herpesvirus, poxvirus, Epstein-Barr virus, Sindbis virus and adenovirus; autoimmune 
diseases such as systemic lupus erythematosus, immune mediated glomerulonephritis, 
rheumatoid arthritis, psoriasis, inflammatory bowel diseases and autoimmune diabetes 

30 mellitus; neurodegenerative disorders such as Alzheimer's disease, ADDS-related 
dementia, Parkinson's disease, amyotrophic lateral sclerosis, retinitis pigmentosa, 
spinal muscular atrophy and cerebellar degeneration; ADDS; myelodysplastic 
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syndromes; aplastic anemia; ischemic injury associated myocardial infarctions; stroke 

and reperfusion injury; restenosis; arrhythmia; atherosclerosis; toxin-induced or 

alcohol induced liver diseases; hematological diseases such as chronic anemia and 

aplastic anemia; degenerative diseases of the musculoskeletal system such as 

5 osteoporosis and arthritis; aspirin-sensitive rhinosinusitis; cystic fibrosis; multiple 

sclerosis; kidney diseases; and cancer pain. 

The following nonlimiting examples are provided to further illustrate the 
present invention. 
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EXAMPLES 

Example 1: Reagents 
R2 Medium was prepared as follows: 
5 A solution containing sucrose (103 grams), K2SO4 (0.25 grams) MgCl 2 «6H 2 0 

(10.12 grams), glucose (10 grams), Difco Casaminoacids (0.1 grams) and distilled 
water (800 ml) was prepared. Eighty ml of this solution was then poured into a 200 
ml screw capped bottle containing 2.2 grams Difco Bacto agar. The bottle was 
capped and autoclaved. At time of use, the medium was remelted and the following 
10 autoclaved solutions were added in the order listed: 

lmlKH 2 PO 4 (0.5%) 

8mlCaCl 2 #2H 2 0(3.68%) 

1 .5 ml ^-proline (20%) 

10 ml TES buffer (5.73%, adjusted to pH 7.2) 
15 0.2 ml Trace element solution containing ZaCl 2 (40mg), FeCl 3 »6H 2 O(200 mg), 

Cua 2 *2H 2 0 (10 mg), MnCl 2 *4H 2 0 (10 mg), Na 2 B 4 O 7 *10H 2 O (10 mg), and 
(NH 4 )6Mo 7 0 24 »H 2 0 

0.5 ml NaOH (lN)(sterilization not required) 

0.5 ml Required growth factors for auxotrophs (Histidine (50 pg/ml); Cysteine 
20 (37 pg/ml); adenine, guanine, thymidine and uracil (7.5 [ig/ml); and Vitamins (0.5 
pg/ml). 

R2YE medium was prepared in the same fashion as R2 medium. However, 5 ml of 
Difco yeast extract (10%) was added to each 100 ml flask at time of use. 

25 

P (protoplast) buffer was prepared as follows: 

A basal solution made up of the following was prepared: 
Sucrose (103 grams) 
K 2 S0 4 (0.25 grams) 
30 MgCl 2 *6H 2 0 (2.02 grams) 

Trace Element Solution as described for R2 medium (2 ml) 
Distilled water to 800 ml 
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Eighty ml aliquots of the basal solution were then dispensed and autoclaved. Before 
use, the following was added to each flask in the order listed: 



T (transformation) buffer was prepared by mixing the following sterile solutions: 
25 ml Sucrose (10.3%) 
75 ml distilled water 

1 ml Trace Element Solution as described for R2 medium 



The following are then added to 9.3 mis of this solution: 
0.2mlCaCI 2 (5M) 

0.5 ml Tris maleic acid buffer prepared from 1 M solution of Tris adjusted to 
pH 8.0 by adding maleic acid. 
15 For use, 3 parts by volume of the above solution are added to 1 part by weight of PEG 
1000, previously sterilized by autoclaving. 

L (lysis) buffer was prepared by mixing the following sterile solutions: 



5 



lmlKH 2 PO 4 (0.5%) 

10 ml CaCl 2 *2H 2 0 (3.68%) 

TES buffer (5.75%, adjusted to pH 7.2) 



10 



lmlK 2 S0 4 (2.5%) 



20 



100 ml Sucrose (10.3%) 

10 ml TES buffer (5.73%, adjusted to pH 7.2) 

lmlK 2 S0 4 (2.5%) 



1 ml Trace Element Solution as described for R2 medium. 



25 



lmlKH2PO 4 (0.5%) 

0.1 ml MgCl 2 *6H 2 0 (2.5 M) 

lmlCaCl 2 (0.25M) 
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CRM Medium 

A solution containing the following components was prepared in 1 liter of 
dH 2 0: glucose (10 grains), sucrose (103 grams), MgCl 2 »6H 2 0 (10.12 grains), BBL™ 
trypticase soy broth (15 grams) (Becton Dickinson Microbiology Systems, Sparks, 
5 Maryland, USA), and BBL™ yeast extract (5 grams) (Becton Dickinson 

Microbiology Systems). The solution was autoclaved for 30 minutes. Thiostrepton 
was added to a concentration of 10 jug/ml for cultures propagated with plasmids. 

Electroporation Buffer 

10 A solution containing 30% (wt/vol) PEG 1000, 10% glycerol, and 6.5 % 

sucrose was prepared in dH 2 0. The solution was sterilized by vacuum filtration 
through a 0.22 Jim cellulose acetate filter. 

Example 2: Extraction of Chromosomal DNA from Strain SC15847 

15 Genomic DNA was isolated from an Amycolatopsis orientalis soil isolate 

strain designation SC15847 (ATCC PT-1043) using a guanidine-detergent lysis 
method, DNAzol reagent (Invitrogen, Carlsbad, California, USA). The SC15847 
culture was grown 24 hours at 28°C in F7 medium (glucose 2.2%, yeast extract 1.0%, 
malt extract 1.0 %, peptone 0.1%, pH 7.0). Twenty ml of culture was harvested by 

20 centrifugation and resuspended in 20 ml of DNAzol, mixed by pipetting and 

centrifiiged 10 minutes in the Beckman TJ6 centrifuge. Ten ml of 100% ethanol was 
added, inverted several times and stored at room temperature 3 minutes. The DNA 
was spooled on a glass pipette washed in 100% ethanol and allowed to air dry 10 
minutes. The pellet was resuspended in 500 pi of 8mM NaOH and once dissolved it 

25 was neutralized with 30 pi of 1M HBPES pH7.2. 

Example 3: PCR Reactions 

PCR reactions were prepared in a volume of 50 pi, containing 200-500 ng of 
genomic DNA or 1 .0 pi of the cDNA, a forward and reverse primer, and the forward 
30 primer being either P450-l + (SEQ ID NO:23) or P450-la + (SEQ ID NO:24) or P450- 
2 + (SEQ ID NO:25) and the reverse primer P450-3 - (SEQ ID NO:27)or P450-2(SEQ 
ID NO:26). All primers were added to a final concentration of 1.4- 2.0 pM. The PCR 
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reaction was prepared with 1 pi of Taq enzyme (2.5 units) (Stratagene), 5 |Jl of Taq 
buffer and 4 Jil of 2.5 mM of dNTPs with dH 2 0 to 50 |iL The cycling reactions were 
performed on a Geneamp® PCR system with the following protocol: 95°C for 5 
minutes, 5 cycles [95°C 30 seconds, 37°C 15 seconds (30% ramp), 72°C 30 seconds], 
5 35 cycles (94°C 30 seconds, 65°C 15 seconds, 72°C 30 seconds), 72°C 7 minutes. 
The expected sizes for the reactions are 340 bp for the P450-l + (SEQ ID NO:23) or 
P450~la + (SEQ ID NO:24) and P450-3" (SEQ ID NO:27) primer pairs, 240 bp for the 
P450-1+ (SEQ ID NO:23) and P450-2" (SEQID NO:26) primer pahs and 130 bp for 
the P450-2+ (SEQ ID NO:25) and P450-3 - (SEQ ID NO:27) primer pairs. 

10 

Example 4: Cloning of Epothilone B Hydroxylase and Ferredoxin Genes 

Twenty \ig of SC15847 genomic DNA was digested with BgUI restriction 
enzyme for 6 hours at 37°C. A 30k nanosep column (Gelman Sciences, Ann Aibor, 
Michigan, USA) was used to concentrate the DNA and remove the enzyme and 

15 buffer. The reactions were concentrated to 40 pi and washed with 200 pi of TE. The 
digestion products were then separated a 0.7% agarose gel and genomic DNA in the 
range of 12-15 kb was excised from the gel and purified using the Qiagen gel 
extraction method. The genomic DNA was then ligated to plasmid pWB19N (U.S. 
Patent 5,516,679), which had been digested with BamHI and dephosphorylated using 

20 the SAP I enzyme (Roche Molecular Biochemicals, Indianapolis, Indiana, catalog#l 
758 250). Ligation reactions were performed in a 15 pi volume with 1U of T4 DNA 
ligase (Invitrogen) for 1 hour at room temperature. One pi of the ligation was 
transformed to 100 pi of chemically competent DH10B cells (Invitrogen) and 100 pi 
plated to five LB agar plates with 30 pg/ml of neomycin, 37°C overnight. 

25 Five nylon membrane circles (Roche Molecular Biochemicals, Indianapolis, 

Indiana) were numbered and marked for orientation. The membranes were placed on 
the plates 2 minutes and then allowed to dry for 5 minutes. The membranes were then 
placed on Whatman filter disks saturated with 10% SDS for 5 minutes, 0.5N NaOH 
with 1.5 M NaCl for 5 minutes, L5 M NaCl with 1.0 M Tris pH 8.0 for 5 minutes, 

30 and 15 minutes on 2X SSC. The filters were hybridized as described previously for 
the Southern hybridization. Hybridizing colonies were picked to 2 ml of TB with 30 
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jlg/ml neomycin and grown overnight at 37°C. Plasmid DNA was isolated using a 
miniprep column procedure (Mo Bio). This plasmid was named NPB29-L 

Example 5: DNA Sequencing and Analysis 
5 The cloned PCR products were sequenced using fluorescent-dye-labeled 

terminator cycle sequencing, Big-Dye sequencing kit (Applied Biosystems, Foster 
city, California, USA) and were analyzed using laser-induced fluorescence capillary 
electrophoresis, ABI Prism 310 sequencer (Applied Biosystems). 

10 Example 6: Extraction of Total RNA 

Total RNA was isolated from the SC15847 culture using a modification of the 
Chomczynski and Sacchi method with a mono-phasic solution of phenol and 
guanidine isothiocyanate, Trizol reagent (Invitrogen). Five ml of an SC15847 frozen 
stock culture was thawed and used to inoculate 100 ml of F7 media in a 500 ml 

15 Erlenmeyer flask. The culture was grown in a shaker incubator at 230 rpm, 30°C for 
20 hours to an optical density at 600 nm (OD600) of 9.0. The culture was placed in a 
16°C shaker incubator at 230 rpm for 20 minutes. Fifty-five milligrams of epothilone 
B was dissolved in 1 ml of 100% ethanol and added to the culture. A second ml of 
ethanol was used to rinse the residual epothilone B from the tube and added to the 

20 culture. The culture was incubated at 16°C, 230 rpm for 30 hours. Thirty ml of the 
culture was transferred to a 50 ml tube, 150 mg of lysozyme was added to the culture 
and the culture was incubated 5 minutes at room temperature. Ten ml of the culture 
was placed in a 50 ml Falcon tube and centrifuged 5 minutes, 4°C in a TJ6 centrifuge. 
Two ml of chloroform was added and the tube was mixed vigorously for 15 seconds. 

25 The tube was incubated 2 minutes at room temperature and centrifuged 10 minutes, 
top speed in the TJ6 centrifuge. The aqueous layer was transferred to a fresh tube and 
2.5 ml of isopropanol was added to precipitate the RNA. The tube was incubated 10 
minutes at room temperature and centrifuged 10 minutes, 4°C. The supernatant was 
removed, the pellet was rinsed with 70% ethanol and dried briefly under vacuum- The 

30 pellet was resuspended in 150 |xl of RNase-free dH 2 0. Fifty jil of 7.5M LiCl was 
added to the RNA and incubated at -20°C for 30 minutes. The RNA was pelleted by 
centrifugation 10 minutes, 4°C in a microcentrifuge. The pellet was rinsed with 200 \d 
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of 70% ethanol, dried briefly under vacuum and resuspended in 150 pi of RNase free 
dH 2 0. 

The RNA was treated with DNasel (Ambion, Austin, Texas, USA). Twenty- 
five pi of total RNA (5.3 fig/fil), 2.5 fxl of DNasel buffer, 1.0 pi of DNasel added 
5 and incubated at 37°C for 25 minutes. Five pi of DNase I inactivation buffer added, 
incubated 2 minutes, centrifuged 1 minute, the supernatant was transferred to a fresh 
tube. 

Example 7: cDNA Synthesis 

10 cDNA was synthesized from the total RNA using the Superscript II enzyme 

(Invitrogen). The reaction was prepared with 1 pi of total RNA (5.3 fig/pl), 9 pi of 
dH 2 0, 1 pi of dNTP mix (10 mM), and 1 pi of random hexamers. The reaction was 
incubated at 65°C for 5 minutes then placed on ice. The following components were 
then added: 4 pi of 1 st strand buffer, 1 pi of RNase Inhibitor, 2.0 pi of 0.1 M DTT, 

15 and 1 pi of Superscript II enzyme. The reaction was incubated at room temperature 10 
minutes, 42°C for 50 minutes and 70°C for 15 minutes. One pi of RNaseH was added 
and incubated 20 minutes at 37°C, 15 minutes at 70°C and stored at 4°C. 

Example 8: DNA Labeling 

20 The PGR conditions used to amplify the P450 specific products from genomic 

DNA and cDNA were used to amplify the insert of plasmid pCRscript-29. Plasmid 
pCRscript-29 contains a 340bp PCR fragment amplified from SC15847 genomic 
DNA using primers P450 1 + (SEQ ID NO:23) and P450 3~ (SEQ ID NO:27). Two pi 
of the plasmid prep was used as a template, with a total of 25 cycles. The amplified 

25 product was gel purified using the Qiaquick gel extraction system (Qiagen). The 

extracted DNA was ethanol precipitated and resuspended in 5 pi of TE, the yield was 
estimated to be 500 ng. This fragment was labeled with digoxigenin using the chem 
link labeling reagent (Roche Molecular Biochemicals, Indianapolis, Indiana catalog 
#1 836 463). Five pi of the PCR product was mixed with 0.5 pi of Dig-chem link and 

30 dH 2 0 added to 20 pi. The reaction was incubated 30 minutes at 85°C and 5 pi of stop 
solution added. The probe concentration was estimated at 20 ng/pl. 
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Example 9: Southern DNA Hybridization 

Ten pi of genomic DNA (0.5 jig/pl) was digested with BamHI, Bgffl, EcoRI, 
HindlH or NotI and separated at 12 volts for 16 hours. The gel was depurinated 10 

5 minutes in 0.25 N HC1 and transferred by vacuum to a nylon membrane (Roche 
Molecular Biochemicals) in 0.4 N NaOH 5" Hg , 90 minutes using a vacuum blotter 
(Bio-Rad Laboratories, Inc. Hercules, California, USA catalog # 165-5000). The 
membrane was rinsed in 1 M ammonium acetate and UV-crosslinked using the 
Stratalinker UV Crosslinker (Stratagene). The membrane was rinsed in 2X SSC and 

10 stored at room temperature. 

The membrane was prehybridized 1 hour at 42°C in 20 ml of Dig Easy Hyb 
buffer (Roche Molecular Biochemicals). The probe was denatured 10 minutes at 65°C 
and then placed on ice. Five ml of probe in Dig-Easy Hyb at an approximate 
concentration on 20 ng/ml was incubated with the membrane at 42°C overnight The 

15 membrane was washed 2 times in 2X SCC with 0.1% SDS at room temperature, then 
2 times in 0.5X SSC with 0.1% SDS at 65°C. The membrane was equilibrated in 
Genius buffer 1 (10 mM maleic acid, 15 mM NaCl; pH 7.5; 0.3% v/v Tween 20) 
(Roche Molecular Biochemicals, Indianapolis, Indiana) for 2 minutes, then incubated 
with 2% blocking solution (2% Blocking reagent in Genius Buffer l)(Roche 

20 Molecular Biochemicals Indianapolis, Indiana) for 1 hour at room temperature. The 
membrane was incubated with a 1:20,000 dilution of anti-dig antibody in 50 ml of 
blocking solution for 30 minutes. The membrane was washed 2 times, 15 minutes 
each in 50 ml of Genius buffer L The membrane was equilibrated for two minutes in 
Genius Buffer 3 (lOmM Tris-HCl, lOmM NaCl; pH 9.5). One ml of a 1:100 dilution 

25 of CSPD (disodium 3-(4-methoxyspiro{ l,2-dioxetane-3^ , -(5 > - 

chloro)tricyclo[3.3.1.1 3,7 ]decan}-4-yl)phenyl phosphate) (Roche Molecular 
Biochemicals) in Genius buffer 3 was added to the membrane and incubated 5 
minutes at room temperature, then placed at 37°C for 15 minutes. The membrane was 
exposed to Biomax ML film (Kodak, Rochester, New York, USA) for 1 hour. 

30 

Example 10: E. cott Transformation 
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Competent cells were purchased from Invitrogen. E. coli strain DH10B was 
used as a host for genomic cloning. The chemically competent cells were thawed on 
ice and 100 |il aliquoted to a 17 x 100-mm polypropylene tube on ice. One pi of the 
ligation mixture was added to the cells and incubated on ice for 30 minutes. Hie cells 
5 were incubated at 42°G for 45 seconds, then placed on ice 1-2 minutes. 0.9 ml pf 
SOC. medium(Invitrogen) was added and the cells were incubated one hour at 30- 
37°C at 200-240 rpm. Cells were plated on a selective medium (Luria agar with 
neomycin or ampicillin at a concentration of 30 \ig /ml or 100 |xg /ml respectively). 

10 Example 11: Transformation of Streptomyces Uvidans TK24 

Plasmid pWB19N849 was constructed by digesting plasmid pWB19N with 
HindHI and treating with SAP I and digesting plasmid pANT849 (Keiser, et al., 2000, 
Practical Streptomyces Genetics, John Innes ) with HindHI. The two linearized 
fragments were ligated 1 hour at room temperature with 1U of T4 DNA ligase. One \sl 

15 of the ligation reaction was used to transform XL-1 Blue electrocompetent cells 

(Stratagene). The recovered cells were plated to LB neomycin (30 pg/ml) overnight at 
37°C Colonies were picked to 2 ml of LB with 30 fig/ml neomycin and incubated 
overnight at 30°C. MoBio plasmid minipreps were performed on all cultures. 
Plasmids constructed from the ligation of pWB 19N and pANT849 were determined 

20 by electrophoretic mobility on 0.7 % agarose. The plasmid pWB19N849 was digested 
with Hindm and Bglll to excise a 5.3 kb fragment equivalent to plasmid pANT849 
digested with BglH and Hindffl. This 5.3 kb fragment was purified on an agarose gel 
and extracted using the Qiaquick gel extraction system. 

A 1.469 kb DNA fragment containing the epothilone B hydroxylase gene and 

25 the downstream ferredoxin gene was amplified using PCR. The 50 pi PCR reaction 
was composed of 5 pi of Taq buffer, 2.5 \sl glycerol, 1 of 20 ng/jjl NPB29-1 
plasmid, 0.4 pi of 25 mM dNTPs, 1.0 pi each of primers NPB29-6F (SEQ ID NO:28) 
and NPB29-7R (SEQ ID NO:29) (5 pmole/pl), 38.1 \il of dH 2 0 and 0.5 pi of Taq 
enzyme (Stratagene). The reactions were performed on a Perkin Elmer 9700, 95°C for 

30 5 minutes, then 30 cycles (96°C for 30 seconds, 60°C 30 seconds, 72°C for 2 
minutes), and 72°C for 7 minutes. The PCR product was purified using a Qiagen 
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minielute column with the PCR cleanup procedure. The purified product was digested 
with BgUI and HindUT and purified on a 0.7 % agarose gel. A 1.469 kb band was 
excised from the gel and eluted using a Qiagen minielute column. Five \il of this PCR 
product was ligated with 2 pi of the BgUI, Hindm digested pANT849 vector in a 10 
5 pi ligation reaction. The reaction was incubated at room temperature for 24 hours and 
then transformed to & lividans TK24 protoplasts. 

Twenty ml of YEME media was inoculated with a frozen spore suspension of 
S. lividans TK24 and grown 48 hours in a 125 ml bi-indent flask. Protoplasts were 
prepared as described in Practical Streptomyces Genetics. The ligation reaction was 

10 mixed with protoplasts, then 500 pi of transformation buffer was added, followed 
immediately by 5 ml of P buffer. The transformation reactions were spun down 7 
minutes at 2,750 rpm, resuspended in 100 jul of P buffer and plated to one R2YE 
plate. The plate was incubated at 28°C for 20 hours then overlaid with 5 ml of LB 
0.7% agar with 250 Jig/ml thiostrepton. After 7 days colonies were picked to an 

15 R2YE grid plate with 50 fig/ml of thiostrepton. The colonies were grown an 
additional 5 days at 28°C, then stored at 4°C. 

This recombinant microorganism has been deposited with the ATCC and 
designated PTA-4022. 

20 Example 12: Transformation of Streptomyces rimosus 
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The procedure of Pigac and Schrempf Appl. Environ Microb., Vol 61, No. 1, 
352-356 (1995) was used to transform S. rimosus. S. rimosus strain R6 593 was 
cultivated in 20 ml of CRM medium at 30 °C on a rotary shaker (250 rpm). The cells 
were harvested at 24 hrs by centrifugation for 5 minutes, 5,000 rpm, 4 °C, and 

5 resuspended in 20 ml of 10% sucrose, 4 °C, and centrifuged for 5 minutes, 5,000 rpm, 
4 °C. The pellet was resuspended inl0mlofl5% glycerol, 4 °C and centrifuged for 5 
minutes, 5,000 rpm, 4 °C. The pellet was resuspended in 2 ml of 15% glycerol, 4 °C 
with 100 jxg/ml lysozyme and incubated at 37 °C for 30 minutes, centrifuged for 5 
minutes, 5,000 rpm, 4°C and resuspended in 2 ml of 15% glycerol, 4°C The 15% 

10 glycerol wash was repeated once and the pellet was resuspended in 1 to 2 ml of 
Electroporation Buffer. The cells were stored at -80°C in 50 - 200 |xl aliquots. 

The ligations were prepared as described for the S. lividcms transformation. 
After the incubation of the ligation reaction, the volume was brought to 100 fxl with 
dEbO, NaCl was added to 0.3M, and the reaction extracted with an equal volume of 

15 24: 1 : 1 phenolxhoroform isoamyl alcohol. Twenty |ig of glycogen was added and the 
ligated DNA was precipitated with 2 volumes of 100% ethanol at -20 °C for 30 
minutes. The DNA was pelleted 10 minutes in a microcentrifuge, washed once with 
70% ethanol, dried 5 minutes in a speed-vac concentrator and resuspended in 5 |jl of 
dH 2 0. 

20 One frozen aliquot of cells was thawed at room temperature and divided, 50 

|xl/ tube for each DNA sample for electroporation. The cells were stored on ice until 
use. DNA in 1 to 2 pi of dH^O was added and mixed. The cell and DNA mixture was 
transferred to a 2 mm gapped electrocuvette (Bio-Rad Laboratories, Richmond 
California USA) that was pre-chilled on ice. The cells were electroporated at a setting 

25 of 2 kV (lOkV/cm), 25|jF, 400 fit using a Gene Pulser™ (Bio-Rad Laboratories). The 
cells were diluted with 0.75 to 1.0 ml of CRM (0-4 °C), transferred to 15 ml culture 
tubes and incubated with agitation 3 hrs at 30 °C. The cells were plated on trypticase 
soy broth agar plates with 10-30 fig/ml of thiostrepton. 

30 Example 13: High Performance liquid chromatography 

The liquid chromatography separation was performed using a Waters 2690 
Separation Module system (Waters Corp., Milford, MA, USA) and a column, 4.6 x 
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150 nun, filled with SymmetryShield RP 8 , particle size 3,5 Jim (Waters Corp., 
Milford, MA, USA). The gradient mobile phase programming was used with a flow 
rate of 1.0 ml/minute. Eluent A was water/acetonitrile (20: 1) + 10 mM ammonium 
acetate. Eluent B was acetonitrile/water (20:1). The mobile phase was a linear 
5 gradient from 12% B to 28 % B over 6 minutes and held isocratic at 28% B over 4 
minutes. This was followed by a 28% B to 100% B linear gradient over 20 minutes 
and a linear gradient to 12% B over two minutes with a 3 minute hold at 12% B. 

Example 14: Mass spectrometry 

10 The column effluent was introduced directly into the electrospray ion source of a 

ZMD mass spectrometer (Micromass, Manchester, UK). The instrument was calibrated 
using Test Juice reference standard (Waters Corp, Milford, MA, USA) and was delivered 
at a flow of 10 [il/minute from a syringe pump (Harvard Apparatus, Holliston, MA, 
USA). The mass spectrometer was operated at a low mass resolution of 13.2 and a high 

15 mass resolution of 1 1 .2. Spectra were acquired from using a scan range of m/z 100 to 
600 at an acquisition rate of 10 spectra /second. The ionization technique employed was 
positive electrospray (ES). The sprayer voltage was kept at 2900 V and the cone voltage 
of the ion source was kept at a potential of 17 V. 

20 Example 15: Use of the ebh gene sequence (SEQ ID NO:l) to isolate cytochrome 
P450 genes from other microorganisms 

Genomic DNA was isolated from a set of cultures (ATCC43491, 
ATCC14930, ATCC53630, ATCC53550, ATCC39444, ATCC43333, ATCC35165) 
using the DNAzol reagent. The DNA was used as a template for PCR reactions using 

25 primers designed to the sequence of the ebh gene. Three sets of primers were used for 
amplification; NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID NO:29), NPB29- 
16f (SEQ ID NO:50) and NPB29-17r (SEQ ID NO:51), and NPB29-19f (SEQ ID 
NO:52) and NPB29-20r (SEQ ID NO:53). 

PCR reactions were prepared in a volume of 20 pi, containing 200-500 ng of 

30 genomic DNA and a forward and reverse primer. All primers were added to a final 
concentration of 1.4- 2.0 pM. The PCR reaction was prepared with 0.2 pi of 
Advantage™ 2 Taq enzyme (BD Biosciences Clontech, Palo Alto, California, USA), 
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2 pi of Advantage™ 2 Taq buffer and 0.2 pi of 2.5 mM of dNTPs with dH 2 0 to 20 
pi. The cycling reactions were performed on a Geneamp® 9700 PCR system or a 
Mastercycler® gradient (Eppendorf, Westbury, New York, USA) with the following 
protocol: 95°C for 5 minutes, 35 cycles (96°C 20 seconds, 54-69°C 30 seconds, 72°C 

5 2 minutes), 72°C for 7 minutes. The expected size of the PCR products is 

approximately 1469 bp for the NPB29-6f (SEQ ID NO:28) and NPB29-7r (SEQ ID 
NO:29) primer pair, 1034 bp for the NPB29-16f (SEQ ID NO:50) and NPB29-17r 
(SEQ ID NO:51) primer pair and 13 18 bp for the NPB29-19f (SEQ ID NO:52) and 
NPB29-20r (SEQ ID NO:53) primer pair. The PCR reactions were analyzed on 0.7% 

10 agarose gels . PCR products of the expected size were excised from the gel and 
purified using the Qiagen gel extraction method. The purified products were 
sequenced using the Big-Dye sequencing kit and analyzed using an ABB 10 
sequencer. 
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Example 16: Construction of plasmid pPCRscript-eM 

A 1.469 kb DNA fragment containing the epothilone B hydroxylase gene and 
the downstream ferredoxin gene was amplified using PCR. The 50 jLtl PCR reaction 
was composed of 5 pi of Taq buffer, 2.5 pi glycerol, 1 |il of 20 ng/pl NPB29-1 
5 plasmid, 0.4 |Jl of 25 mM dNTPs, 1.0 pi each of primers NPB29-6f (SEQ ID NO:28) 
and NPB29-7r (SEQ ID NO:29) (5 pmole/pl), 38.1 pi of dH 2 0 and 0.5 pi of Taq 
enzyme (Stratagene). The reactions were performed on a Geneamp® 9700 PCR 
system, with the following conditions; 95°C for 5 minutes, then 30 cycles (96°C for 
30 seconds, 60°C 30 seconds, 72°C for 2 minutes), and 72°C for 7 minutes. The PCR 
10 product was purified using a Qiagen Qiaquick column with the PCR cleanup 

procedure. The purified product was digested with Bglll and HindTTT and purified on a 
0.7 % agarose gel. A 1.469 kb band was excised from the gel and eluted using a 
Qiagen Qiaquick gel extraction procedure. The fragments were then cloned into the 
pPCRscript Amp vector using the PCRscript Amp cloning kit Colonies containing 
15 inserts were picked to 1-2 ml of LB (Luria Broth) with 100 pg/ml ampicillin, 30- 
37°C, 16-24 hours, 230-300 rpm. Plasmid isolation was performed using the Mo Bio 
miniplasmid prep kit. The sequence of the insert was confirmed by cycle sequencing 
with the Big-Dye sequencing kit This plasmid was named pPCRscript-eWi. 

Example 17: Mutagenesis of the ebh gene for improved yield or altered 
specificity 

The Quikchange® XL Site-Directed Mutagenesis Kit and the Quikchange® 
Multi Site-Directed Mutagenesis kit, both from Stratagene were used to introduce 
mutations in the coding region of the ebh gene. Both of these methods employ DNA 
primers 35-45 bases in length containing the desired mutation (SEQ ID NO:54-59 and 
71), a methylated circular plasmid template and PfuTurbo® DNA Polymerase (U.S. 
Patent Nos 5,545,552 and 5,866,395 and 5,948,663) to generate copies of the plasmid 
template incorporating the mutation carried on the mutagenic primers. Subsequent 
digestion of the reaction with the restriction endonuclease enzyme Dpnl, selectively 
digests the methylated plasmid template, but leaves the non-methylated mutated 
plasmid intact. The manufacturer's instructions were followed for all procedures with 
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the exception of the Dpnl digestion step in which the incubation time was increased 
from 1 hx to 3 hrs. The pPCRscript-e&ft vector was used as the template for 
mutagenesis. 

One to two |xl of the reaction was transformed to either XLl-Blue® 

5 electiocompetent or XLIO-Gold® ultracompetent cells (Stratagene). Cells were plated 
to a density of greater than 100 colonies per plate on LA (Luria Agar) 100 |ig/ml 
ampicillin plates, and incubated 24-48 hrs at 30-37°C The entire plate was 
resuspended in 5 ml of LB containing 100 fig/ml ampicillin. Plasmid was isolated 
directly from the resuspended cells by centrifuging the cells and then purifying the 

10 plasmid using the Mo Bio miniprep procedure. This plasmid was then used as a 

template for PCR with primers NPB29-6f (SEQ ID NO:28)and NPB29-7r (SEQ ID 
NO:29) to amplify a mutated expression cassette. Digestion of the 1.469 kb PCR 
product with the restriction enzymes BgUI and HindUT was used to prepare this 
fragment for ligation to vector pANT849 also digested with BgLH and HindTTT. 

15 Alternatively, the resuspended cells were used to inoculate 20- 50 ml of LB 

containing 100 (Xg/ml ampicillin and grown 18-24 hrs at 30-37°C. Qiagen midi-preps 
were performed on the cultures to isolate plasmid DNA containing the desired 
mutation. Digestion with the restriction enzymes Bgin and Hindll was used to excise 
the mutated expression cassette for ligation to BgUI and Hindlff digested plasmid 

20 pANT849. Screening of mutants was performed in S. lividans or S. rimosus as 
described. 

Alternatively, the method of Leung et al„ Technique-A Journal of Methods in 
Cell and Molecular Biology. Vol. 1, No. 1, 11-15 (1989) was used to generate random 
mutation libraries of the ebh gene. Manganese and/or reduced dATP concentration is 

25 used to control the mutagenesis frequency of the Taq polymerase. The plasmid 

pCRscript-eM was digested with NotI to linearize the plasmid. The Polymerase buffer 
was prepared with 0.166 M (NH4) 2 S0 4 , 0.67M Tris-HCl pH 8.8, 61 mM MgCl 2 , 67 
pM EDTA pH8.0, 1.7 mg/ml Bovine Serum Albumin). The PCR reaction was 
prepared with 10 jjl of Not I digested pCRscript-eWi (O.lng/pl), 10 pi of polymerase 

30 buffer, 1.0 pi of 1M P-mercaptoethanol, 10.0 pi of DMSO, 1.0 pi of NPB29-6f (SEQ 
ID NO:28) primer (100 pmole/pl), 1.0 pi of NPB29-7r (SEQ ID NO:29) primer (100 
pmole/pl), 10 Ml of 5 mM MnCl 2 , 10.0 pi 10 mM dGTP, 10.0 pi 2 mM dATP, 10 mM 
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dTTP, 10.0 |Jl 10 mM dCTP, and 2.0 pi Taq polymerase. dH 2 0 was added to 100 pi. 
Reactions were also prepared as described above but without MnCk The cycling 
reactions were performed a GeneAmp® PGR system with the following protocol: 
95°C for 1 minute, 25-30 cycles (94 °C for 1 minute, 55 °C for 30 seconds, 72 °C for 4 
5 minutes), 72 °C for 7 minutes. The PCR reactions were separated on an agarose gel 
using a Qiagen spin column. The fragments were then digested with BgLH and HindTTT 
and purified using a Qiagen spin column. The purified fragments were then ligated to 
Bglll and HindHI digested pANT849 plasmids. Screening of mutants was performed 
in S. lividans and S, rimosus. 

10 



Table of Characterized Mutants 



Mutant 


Position 


Substitution 


Wild-type 


ebh3A-16 


92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 


ebh25-l 


195 


Serine 


Asparagine 




294 


Proline 


Serine 


ebhlO-53 


190 


Tyrosine 


Phenylalanine 




231 


Axginine 


Glutamic acid 


eWi24-16d8 


92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 




67 


Glutamine 


Axginine 


6^24-16011 


92 


Valine 


Isoleucine 




93 


Glycine 


Alanine 




237 


Alanine 


Phenylalanine 




365 


Threonine 


Isoleucine 


eWt24-16-16 


92 


Valine 


Isoleucine 




106 


Alanine 


Valine 




237 


Alanine 


Phenylalanine 


eWi24-16-74 


88 


Histidine 


Axginine 




92 


Valine 


Isoleucine 




237 


Alanine 


Phenylalanine 


ebh-UlS 


31 


Lysine 


Glutamic acid 




176 


Valine 


Methionine 


eWt24-16g8 


92 


Valine 


Isoleucine 
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e&/i24-16b9 



237 
67 
130 
176 

92 

237 

67 

140 

176 



Alanine 
Glutamine 
Threonine 
Alanine 

Valine 

Alanine 

Glutamine 

Threonine 

Serine 



Phenylalanine 
Arginine 
Isoleucine 
Methionine 

Isoleucine 

Phenylalanine 

Arginine 

Alanine 

Methionine 



Example 18: Comparison of epothilone B transformation in cells expressing ebh 
and mutants thereof 

5 In these experiments, twenty ml of YEME medium in a 125 ml bi-indented 

flask was inoculated with 200 pi of a frozen spore preparation of S. lividans TK24, S. 
lividans (pANT849-*M), S. lividans (pANT849-eWilO-53) or S. lividans (pANT849- 
efc/i24-16) and incubated 48 hours at 230 rpm, 30°C Thiostrepton, 10 jug/ml was 
added to media inoculated with S. lividans (pANT849-e&/i), S. lividans (pANT849- 

10 e&ftl0-53) and S. lividans (pANT849-eZ>fc24-16). Four ml of culture was transferred to 
20 ml of R5 medium in a 125 ml Erlenmeyer flask and incubated 18 hrs at 230 rpm, 
30°C Epothilone B in 100% EtOH was added to each culture to a final concentration 
of 0.05% weight/volume. Samples were taken at 0, 24, 48 and 72 hours with the 
exception of the S. lividans (pANT849-eM24-16) culture, in which the epothilone B 

15 had been completely converted to epothilone F at 48 hours. The samples were 
analyzed by HPLC. Results were calculated as a percentage of the epothilone B at 
time 0 hours. 
Epothilone B: 



Time (hours) 


TK24 


P ANT849-«W« 


pANT849-cW»10-53 


pANT849-c6/!24-16 


0 


100% 


100% 


100% 


100% 


24 


99% 


78% 


69% 


56% 


48 


87% 


19% 


39% 


0% 


72 


87% 


0% 


3% 




Epothilone F: 


Time (hours) 


TK24 


pANK49-ebh 


pANT849-66A10-53 


pANT849^/i24-16 
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0 0% 0% 0% 0% 

24 0% 4% 9% 23% 

48 0% 21% 29% 52% 

72 0% 14% 41% — 



Alternatively, the bioconversion of epothilone B to epothilone F was 
performed in S. rimosus host cells transformed with expression plasmids containing 
the ebh gene and its variants or mutants. One-hundred pi of a frozen S. rimosus 

5 transformant culture was inoculated to 20 ml CRM media with 10 ng/ml thiostrepton 
and cultivated 16-24 hr, 30°C, 230- 300 rpm. Epothilone B in 100% ethanol was 
added to each culture to a final concentration of 0.05% weight/volume. The reaction 
was typically incubated 20- 40hrs at 30 °C, 230-300 rpm. The concentration of 
epothilones B and F was determined by HPLC analysis. 

10 Evaluation of mutants in S. rimosus 



Mutant 


Epothilone F yield 


ebh-MIS 


55% 


ebhlA-\(A% 


75% 


ebh2A-l6cll 


75% 


ebh24-16-16 


75% 


eWt24-16-74 


75% 


ebK2A-\6b9 


80% 


ebh2A-16gS 


85% 



Example 19: Biotransformation of compactin to pravastatin 

Twenty ml of R2YE media with 10 (ig/ml thiostrepton in a 125 ml flask was 
inoculated with 200 pi of a frozen spore preparation of 5. lividans (pANT849), S. 

15 lividans (pANT849-eWi) and incubated 72 hours at 230 rpm, 28°C. Four ml of culture 
was inoculated to 20 ml of R2YE media and grown 24 hours at 230 rpm, 28°C. One 
ml of culture was transferred to a 15 ml polypropylene culture tube, 10 pi of 
compactin (40 mg/ml) was added to each culture and incubated for 24 hours, 28°C, 
250 rpm. Five hundred pi of the culture broth was transferred to a fresh 15 ml 

20 polypropylene culture tube. Five hundred pi of 50 mM sodium hydroxide was added 
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and vortexed. Three ml of methanol was added and vortexed, the tube was 
centrifuged 10 minutes at 3000 rpm in a TJ-6 table-top centrifuge. The organic phase 
was analyzed by HPLC. Compactin and pravastatin values were assessed relative to 
the control S. lividans (pANT849) culture. 

5 

Compactin and Pravastatin as a Percentage of Starting Compactin 
Concentration: 

— — s. lividans (pAm&49) S. lividans (pANT849^6A) 

Compactin 36% 11% — 

Pravastatin 1 1 % 53% 



Example 20: High performance liquid chromatography method for compactin and 

10 pravastatin detection 

The liquid chromatography separation was performed using a Hewlett 
Packardl090 Series Separation system (Agilent Technologies, Palo Alto, California, 
USA) and a column, 50x46 mm, filled with Spherisorb ODS2, particle size 5 jxm 
(Keystone Scientific, Ihc, Bellefonte, Pennsylvania, USA). The gradient mobile 

15 phase programming was used with a flow rate of 2.0 ml/minute. Eluent A was water, 
10 mM ammonium acetate and 0.05% Phosphoric Acid. Eluent B was acetonitrile. 
The mobile phase was a linear gradient from 20% B to 90 % B over 4 minutes. 

Example 21: Structure determination of the biotransformation product of 
20 mutant ebh25-l 

Analytical HPLC was performed using a Hewlett Packard 1100 Series Liquid 
Chromatograph with a YMC Packed ODS-AQ column, 4.6 mm i.d. x 15 cm 1. A 
gradient system of water (solvent A) and acetonitrile (solvent B) was used: 20% to 
90% B linear gradient, 10 minutes; 90% to 20% linear gradient, 2 minutes. The flow 
25 rate was 1 ml/minute and UV detection was at 254 nm. 

Preparative HPLC was performed using the following equipment and 
conditions: 

Pump: Varian ProStar Solvent Delivery Module (Varian Inc., Palo Alto, California, 
USA). Detector: GynkotekUVD340S. 
30 Column: YMC ODS-A column (30mmID X 100 mm length, 5[i particle size). 
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Elution flow rate: 30 ml/minute 

Ehition gradient: (solvent A: water; solvent B: acetonitrile), 20% B f 2 minutes; 20% 
to 60% B linear gradient, 18 minutes; 60% B, 2 minutes; 60 % to 90% B linear 
gradient, 1 minute; 90 % B, 3 minutes; 90 % to 20% B linear gradient, 2 minutes, 
5 Detection: UV, 210 nm. 

LC/NMR was performed as follows: 40 pi of sample was injected onto a 
YMC Packed ODS-AQ column (4.6 mm i.d. x 15 cm 1). The column was eluted at 1 
ml/minute flow rate with a gradient system of D 2 0 (solvent A) and acetonitrile-d 3 
(solvent B): 30% B, 1 minute; 30% to 80% B linear gradient, 11 minutes. The eluent 

10 passed a UV detection cell (monitored at 254 nm) before flowing through a F19/H1 
NMR probe (60 \& active volume) in Varian AS-600 NMR spectrometer. The 
biotransformation product was eluted at around 7.5 minutes and the flow was stopped 
manually to allow the eluent to remain in the NMR probe for NMR data acquisition. 
Isolation and analysis was performed as follows. The butanol/methanol extract 

15 (about 10 ml) was evaporated to dryness under nitrogen stream. One ml methanol 
was added to the residue (38 mg) and insoluble material was removed by 
centrifugation (13000 rpm, 2 min). 0.1 ml of the supernatant was used for LC/NMR 
study and the rest of 0.9 ml was subjected to the preparative HPLC (0.2-0.4 ml per 
injection). Two major peaks were observed and collected: peak A was eluted between 

20 14 and 15 minutes, while peak B was eluted between 16.5 and 17.5 minutes. 

Analytical HPLC analysis indicated that peak B was the parent compound, epothilone 
B (Rt 8.5 minutes), and peak A was the biotransformation product (Rt 7.3 minutes). 
The peak A fractions were pooled and MS analysis data was obtained with the pooled 
fractions. The pooled fraction was evaporated to a small volume, then was 

25 lyophilized to give 3 mg of white solid. NMR and HPLC analysis of the white solid 
(dissolved in methanol) revealed that the biotransformation product was partially 
decomposed during the drying process. 
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APPENDIX 1 



Atom No. 


Residue 


Atom Name 


X-coord 


Y-coord 


Z-coord 


1 


ALA9 


N 


-39.918 


-4.913 


-1.651 


2 


ALA9 


CA 


-38.454 


-5.033 


-1.537 


3 


ALA9 


C 


-37.953 


-4.886 


-0.099 


4 


ALA9 


0 


-38.625 


-4.31 


0.765 


5 


ALA9 


CB 


-37.809 


-3.967 


-2.415 


6 


THR10 


N 


-36.781 


-5.447 


0.146 


7 


THR10 


CA 


-36.187 


-5.437 


1.49 


8 


THR10 


C 


-34.916 


-4.585 


1.553 


9 


THR10 


0 


-34.016 


-4.735 


0.72 


10 


THR10 


CB 


-35.871 


-6.887 


1.846 


11 


THR10 


OG1 


-37.075 


-7.631 


1.717 


12 


THR10 


CG2 


-35.355 


-7.053 


3.271 


13 


LEU11 


N 


-34.858 


-3.699 


2.536 


14 


LEU11 


CA 


-33.669 


-2.853 


2.745 


15 


LEU 11 


C 


-32.511 


-3.649 


3.353 


16 


LEU11 


O 


-32.706 


-4.468 


4.259 


17 


LEU11 


CB 


-34.033 


-1.707 


3.687 


18 


LEU11 


CG 


-35.079 


-0.78 


3.078 


19 


LEU 11 


CD1 


-35.53 


0.265 


4.091 


20 


LEU11 


CD2 


-34.555 


-0.111 


1.81 


21 


PR012 


N 


-31.32 


-3.422 


2.823 


22 


PR012 


CA 


-30.121 


-4.119 


3.302 


23 


PR012 


C 


-29.652 


-3.606 


4.663 


24 


PR012 


O 


-29.656 


-2.397 


4.918 


25 


PR012 


CB 


-29.081 


-3.842 


2.259 


26 


PR012 


CG 


-29.597 


-2.771 


1.309 


27 


PR012 


CD 


-31.031 


-2.493 


1.729 


28 


LEU13 


N 


-29.278 


-4.522 


5.54 


29 


LEU13 


CA 


-28.676 


-4.118 


6.819 


30 


LEU 13 


C 


-27.183 


-3.88 


6.627 


31 


LEU13 


O 


-26.449 


-4.806 


6.267 


32 


LEU13 


CB 


-28.898 


-5.196 


7.872 


33 


LEU13 


CG 


-30.374 


-5.354 


8.217 


34 


LEU13 


CD1 


-30.587 


-6.516 


9.181 


35 


LEU13 


CD2 


-30.945 


-4.067 


8.802 


36 


ALA14 


N 


-26.72 


-2.741 


7.112 


37 


ALA14 


CA 


-25.355 


-2.266 


6.825 


38 


ALA14 


C 


-24.244 


-2.941 


7.634 


39 


ALA14 


O 


-23.058 


-2.719 


7.372 


40 


ALA14 


CB 


-25.311 


-0.764 


7.075 


41 


ARG15 


N 


-24.628 


-3.792 


8.569 


42 


ARG15 


CA 


-23.664 


-4.537 


9.379 


43 


ARG15 


C 


-23.478 


-5.983 


8.91 


44 


ARG15 


O 


-22.815 


-6.767 


9.599 


45 


ARG15 


CB 


-24.174 


-4.519 


10.81 


46 


ARG15 


CG 


-25.655 


-4.879 


10.84 


47 


ARG15 


CD 


-26.2 


-4.843 


12.26 


48 


ARG15 


NE 


-27.657 


-5.039 


12.256 


49 


ARG15 


cz 


-28.358 


-5.301 


13.36 


50 _ 


ARG15 


NH1 


-29.69 


-5.376 


13.3 


51 


ARG15 


NH2 


-27.735 


-5.412 


14.536 


52 


LYS16 


N 


-24.096 


-6.351 


7.798 
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53 


LYS16 


CA 


-24.016 


-7.741 


7.335 


54 


LYS16 


C 


-22.639 


-8.128 


6.807 


55 


LYS16 


0 


-21.959 


-7.359 


6.115 


56 


LYS16 


CB 


-25.061 


-7.977 


6.252 


57 


LYS16 


CG 


-26.466 


-7.985 


6.839 


58 


LYS16 


CD 


-26.605 


-9.079 


7.892 


59 


LYS16 


CE 


-28.002 


-9.092 


8.499 


60 


LYS16 


NZ 


-28.113 


-10.128 


9.537 


61 


CYS17 


N 


-22.317 


-9.392 


7.036 


62 


CYS17 


CA 


-21.061 


-10.004 


6.56 


63 


CYS17 


c 


-20.737 


-9.771 


5.066 


64 


CYS17 


0 


-19.662 


-9.205 


4.833 


65 


CYS17 


CB 


-21.096 


-11.501 


6.864 


66 


CYS17 


SG 


-21 .33 


-11.937 


8.602 


67 


PR018 


N 


-21.635 


-10.003 


4 1 


68 


PR018 


CA 


-21.293 


-9.756 


2 683 


69 


PR018 


C 


-21 123 


-8 291 


2 24.fi 


70 


PR018 


o 


-21 013 

Cm I m\J IV 


-8.061 


1 0^6 


71 


PR018 


CB 


-22.388 


-10.383 


1 878 


72 


PR018 


CG 


-23.509 


-10 812 

1 VJ.V 1 Cm 


2 802 


73 


PR018 


CD 


-23.002 


-1 0 *554 


d 9(Y7 


74 


PHE19 


N 


-21 137 

Cm I m IWI 


-7 11 


1 1fi2 


75 


PHE19 


CA 


_pn 7Q2 


-*i Q47 




76 


PHE19 


c 


-19.279 


-5.777 


2 788 


77 


PHE19 


0 


-18 789 


-4. Q2 


2 Cllft 


78 


PHE19 


CB 


-21 36 


-5 007 


1 8Q4 


79 


PHE19 


CG 


-22.8 




1 8*i4 


80 


PHE19 


CD1 


-23 051 


\J»Cm 1 


1 919 


81 


PHE19 


CD2 


-23.856 


-5.444 


3.867 


82 


PHE19 


CE1 


-24.355 


-2.853 


3.003 


83 


PHE19 


CE2 


-25.159 


-5.03 


3 629 


84 


PHE19 


CZ 


-25.409 


-3.735 


3.197 


85 


SER20 


N 


-18.573 


-6.687 


3.449 


86 


SER20 


CA 


-17.102 


-6.717 


3.446 


87 


SER20 


c 


-16.569 


-7.839 


4.342 


88 


SER20 


0 


-16.632 


-7.723 


5.573 


89 


SER20 


CB 


-16.557 


-5.371 


3.929 


90 


SER20 


OG 


-17.236 


-5.019 


5.129 


91 


PR021 


N 


-15.974 


-8.867 


3.753 


92 


PR021 


CA 


-15.978 


-9.134 


2.304 


93 


PR021 


C 


-17.267 


-9.836 


1.856 


94 


PR021 


0 


-18.026 


-10.327 


2.702 


95 


PR021 


CB 


-14.8 


-10.047 


2.111 


96 


PR021 


CG 


-14.442 


-10.669 


3.455 


97 


PR021 


CD 


-15.306 


-9.949 


4.481 


98 


PR022 


N 


-17.551 


-9.859 


0.561 


99 


PR022 


CA 


-16.897 


-9.007 


-0.445 


100 


PR022 


C 


-17.4 


-7.575 


-0.296 


101 


PR022 


0 


-18.341 


-7.371 


0.469 


102 


PR022 


CB 


-17.32 


-9.591 


-1.762 


103 


PR022 


CG 


-18.478 


-10.549 


-1.528 


104 


PR022 


CD 


-18.669 


-10.604 


-0.021 


105 


PR023 


N 


-16.687 


-6.605 


-0.842 


106 


PR023 


CA 


-17.224 


-5.241 


-0.897 
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107 


PR023 


c 


-18.525 


-5.21 


-1.693 


108 


PR023 


o 


-18.524 


-5.083 


-2.925 


109 


PR023 


CB 


-16.159 


-4.417 


-1.547 


110 


PR023 


CG 


-15.004 


-5.321 


-1.95 


111 


PR023 


CD 


-15.388 


-6.725 


-1 .509 


112 


GLU24 


N 


-19.62 


-5.122 


-0.956 


113 


GLU24 


CA 


-20.963 


-5.192 


-1.547 


114 


GLU24 


c 


-21.415 


-3.843 


-2.088 


115 


GLU24 


o 


-22.323 


-3.794 


-2.93 


116 


GLU24 


CB 


-21.934 


-5.68 


-0.48 


117 


GLU24 


CG 


-23.27 


-6.137 


-1 .052 


118 


GLU24 


CD 


-23.982 


-7.017 


-0.024 


119 


GLU24 


OE1 


-24.613 


-7.981 


-0.433 


120 


GLU24 


V/LX 


-23.833 


-6.745 


1 15ft 


121 


tyro ^ 


N 


-20 *573 


-P ft43 


-1 ft7ft 


122 


TYRP5 


CA 


-20 842 


-1 47 
i .*t/ 


-p mi 


123 


TYRP5 


c. 


-20 704 


-1 ^11 

1 -O 1 1 


"O.O 1 D 


124 


TYRP5 


o 

VJr 


-P1 *}fi4 


-O 43fi 


-4 *W> 


125 




CR 


-1 Q RPft 


-O ^fifl 


-1 fiOft 

- 1 .DUO 


126 


TYRP5 


CG 


-1Q fi1R 


_n ftftp 


-O 1Pft 

\J. 1 C.KJ 


1P7 


TYRP^ 

. 1 I Y\£—\J 


oni 


-PO fifiP 


-V/. / do 


O 77Q 


1PR 


TYRO*? 


cnp 


-1 ft **R4 


-1 PQft 


n n 1 

\J.O 1 1 


1PQ 


1 Truip 


CP1 


.on /ffi1 


-1 flftP 


P 11Q 


1 ou 


1 I luiO 




- I O. 1 DO 


- 1 .DUO 


1 .DO 


1^1 


1 i n^o 




-1Q P1<% 


-1 4QP 


p *w 


1ftp 


TYRP^ 


OH 


-1Q OPfi 


-1 fiKQ 

- 1 .OOv7 


O.ODD 


133 




M 


-PO 1 


-P PQfi 


-4 4fift 


134 


GLUP6 


OA 


-PO 00Q 


-P PQ^ 


Qpa 


135 


GLU26 


c 


-P1 404. 


-P 4A3 


-ft *»P 


136 


GL U26 


o 

w 


-21 Q2 


-1 *y7P 


-7 177 
/•if/ 


137 


GLU26 


CR 

V/U 


-1 Q 1 29 


-3 454 


-6 3Q 


138 


GLU26 


CG 


-17.813 


-3.593 


-5.628 


139 


GLU26 


CD 


-16.94 


-2.342 


-5.707 


140 


GLU26 


OE1 


-16.345 


-2.12 


-6.749 


141 


GLU26 


OE2 


-16.773 


-1.731 


-4.657 


142 


ARG27 


N 


-22.105 


-3.488 


-6.017 


143 


ARG27 


CA 


-23.437 


-3.805 


-6.538 


144 


ARG27 


c 


-24.504 


-2.909 


-5.921 


145 


ARG27 


o 


-25.496 


-2.591 


-6.59 


146 


ARG27 


CB 


-23.752 


-5.26 


-6.22 


147 


ARG27 


CG 


-22.7 


-6.189 


-6.812 


148 


ARG27 


CD 


-23.031 


-7.653 


-6.55 


149 


ARG27 


NE 


-23.146 


-7.926 


-5.108 


150 


ARG27 


CZ 


-22.251 


-8.648 


-4.428 


151 


ARG27 


NH1 


-21.16 


-9.11 


-5.043 


152 


ARG27 


NH2 


-22.428 


-8.879 


-3.126 


153 


LEU28 


N 


-24.197 


-2.331 


-4.771 


154 


LEU28 


CA 


-25.11 


-1.358 


-4.168 


155 


LEU28 


C 


-25.131 


-0.079 


-4.987 


156 


LEU28 


0 


-26.214 


0.286 


-5.45 


157 


LEU28 


CB 


-24.67 


-1.039 


-2.746 


158 


LEU28 


CG 


-24.868 


-2.224 


-1.81 


159 


LEU28 


CD1 


-24.303 


-1.916 


-0.43 


160 


LEU28 


CD2 


-26.34 


-2.609 


-1.716 
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161 


ARG29 


N 


-23.969 


0.307 


-5.49 


162 


ARG29 


CA 


-23.835 


1.502 


-6.327 


163 


ARG29 


C 


-24.521 


1.334 


-7.677 


164 


ARG29 


O 


-25.271 


2.226 


-8.096 


165 


ARG29 


CB 


-22.345 


1.682 


-6.568 


166 


ARG29 


CG 


-21.997 


2.947 


-7.336 


167 


ARG29 


CD 


-20.519 


2.941 


-7.711 


168 


ARG29 


NE 


-19.696 


2.563 


-6.551 


169 


ARG29 


CZ 


-18.945 


1.459 


-6.523 


170 


ARG29 


NH1 


-18.872 


0.673 


-7.6 
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-26.92 


15.76 


4.71 


436 


PHE63 


CD2 


-25.265 


15.901 


2.972 


437 


PHE63 


CE1 


-27.804 


15.161 


3.824 


438 


PHE63 


CE2 


-26.147 


15.298 


2.087 


439 


PHE63 


CZ 


-27.415 


14.928 


2.512 


440 


SER64 


N 


-25.159 


18.211 


8.569 


441 


SER64 


CA 


-24.502 


18.656 


9.795 


442 


SER64 


i c 


-23.765 


17.525 


10.507 


443 


SER64 


O 


-24.07 


16.34 


10.339 


444 


SER64 


CB 


-25.587 


19.225 


10.7 


445 


SER64 


OG 


-24.96 


19.785 


11.84 


446 


SER65 


N 


-22.719 


17.898 


11.218 


447 


SER65 


CA 


-22.006 


16.938 


12.053 


448 


SER65 


c 


-22.463 


17.032 


13.513 


449 


SER65 


o 


-22.031 


16.247 


14.365 


450 


SER65 


CB 


-20.522 


17.234 


11.936 


451 


SER65 


OG 


-20.167 


17.174 


10.564 


452 


ASP66 


N 


-23.368 


17.961 


13.782 


453 


ASP66 


CA 


-23.901 


18.186 


15.122 


454 


ASP66 


c 


-25.388 


18.496 


14.919 


455 


ASP66 


o 


-25.978 


18.026 


13.938 


456 


ASP66 


CB 


-23.149 


19.393 


15.69 


457 


ASP66 


CG 


-22.904 


19.311 


17.192 


458 


ASP66 


OD1 


-21.835 


19.724 


17.618 


459 


ASP66 


OD2 


-23.871 


19.048 


17.899 


460 


ARG67 


N 


-25.972 


19.246 


15.842 


461 


ARG67 


CA 


-27.32 


19.831 


15.692 


462 


ARG67 


C 


-28.423 


18.78 


15.619 


463 


ARG67 


O 


-28.768 


18.296 


14.533 


464 


ARG67 


CB 


-27.384 


20.684 


14.423 


465 


ARG67 


CG 


-26.263 


21.716 


14.336 


466 


ARG67 


CD 


-26.329 


22.778 


15.428 


467 


ARG67 


NE 


-25.137 


23.64 


15.358 


468 


ARG67 


CZ 


-25.091 


24.799 


14.695 


469 


ARG67 


NH1 


-26.189 


25.28 


14.107 


470 


ARG67 


NH2 


-23.957 


25.503 


14.663 


471 


GLN68 


N 


-28.983 


18.45 


16.768 


472 


GLN68 


CA 


-30.127 


17.538 


16.79 


473 


GLN68 


C 


-31.414 


18.348 


16.65 


474 


GLN68 


O 


-31.728 


19.187 


17.503 


475 


GLN68 


CB 


-30.116 


16.757 


18.1 


476 


GLN68 


CG 


-31.207 


15.692 


18.12 


477 


GLN68 


CD 


-31.109 


14.852 


19.389 


478 


GLN68 


OE1 


-31.941 


14.973 


20.296 


479 


GLN68 


NE2 


-30.137 


13.955 


19.406 


480 


SER69 


N 


-32.129 


18.102 


15.565 


481 


SER69 


CA 


-33.37 


18.833 


15.272 


482 


SER69 


C 


-34.444 


18.558 


16.32 


483 


SER69 


O 


-34.447 


17.495 


16.958 


484 


SER69 


CB 


-33.885 


18.387 


13.91 



-64- 



WO 2004/061116 



PCT/US2003/034082 



485 


SER69 


OG 


-34.261 


17.025 


14 fwa 

it.VOO 


486 


PRO70 


N 


-35.332 


19.526 


16 4QQ 


487 


PRO70 


CA 


-36.438 


19.385 


17 447 


488 


PRO70 


c 


-37.244 


18.122 


17 171 
■ t • 1 / 1 


489 


PRO70 


o 


-37.547 


17.795 


16 01 ft 
■ o.w 10 


490 


PRO70 


CB 


-37.267 


20.622 


17 9Q1 


491 


PRO70 


CG 


-36 6 


21 547 


1fi 9ftR 
1 0.4.00 


492 


PR070 


CD 




20 824 


m ai r 
10.0 10 


493 


SER71 


N 


-37 424 


17 369 


1ft OAK 


494 


SER71 


CA 


-3ft 115 




1ft 9ftQ 

lOucoy 


495 


SER71 


c 


-37 5ftQ 


15 09 


17 OOft 

i / .*iyo 


496 


SER71 


o 


_QQ 070 


14 99ft 


1ft 7ftO 

lo./oy 


497 




CR 


Oi7,0£.0 


1fi 944. 


1ft 111 
10.1 1 1 


498 




1/V2I 


.OQ Q1 Q 


HC COD 
1 0.000 


1ft 777 
lo./// 


499 


inc.* c 


N 


Oft OftO 


14 QftR 

J4.yoo 


1 /.Uol 


500 




nA 


OC ft7Q 

-00.0/ y 


I O.O/D 


1 C QO-f 


w 1 


inc./ ^ 


p 


_Q>< OCA 

-04.004 


1Q vlQ 


■1 e ac7 
1 o.yo7 


502 


PHF79 
r nt/c 


n 


"OO.^O 1 


1Q 7ft ft 
10./ OO 


1 d AtZQ 

1 o.4oo 


\J\J\J 


PHF79 
rnc/4 


vD 


-00.4£0 


1 4.<loo 


1 4.0 72 


504 






-OO.DO/i 


1 4.4O0 




505 


PWF70 
rnC/c 


UUl 


-o/.uy/ 


1 0. / *£4 


13.63 


50fi 


PHF79 
r nC/ 


\SlJC. 


Q7 Af\o 


I 0.009 


1 0.0 1 7 


507 


pUip-TO 

rnc/t 


1 


OQ OQQ 

-00.^00 


lO.O/O 


1 2.853 


ouo 


PWF70 
rnC/ 


otic 


-00.044 


lo.4o9 


12.84 


50Q 


PHF70 
i nc/ ^ 




-oo.yo^ 


"i A 7CQ 

14.7oo 


12.459 


510 


r nu / o 


M 


Q>l /ICQ 


-10 CO 


17.979 


51 1 


PRH7Q 


PA 


QQ Q1 


-JO HO 

\c~ ly 


1 o.7oo 


512 


PRH7Q 


V-/ 


_QO COO 


ll.U<£/ 


lo.lo 


513 


PRH7Q 


n 


-otC. DUD 


0 ooc 
y.oyo 


1 0.000 


514 


PRH7Q 


V-/D 


-QQ flQQ 

-oo.oyo 


11./ /O 




515 


PR073 
r nw / o 


Cfi 
ova 


-Q5 QQO 

-oo.oy<£ 


1 1 .000 


1Q 017 

1 y.yi 1 


516 


PRH7Q 


CD 


_QK 7AO 
-OO. / UO 


1Q CiCiA 


lo.o 


517 


l— l— w / *T 


N 

IN 


-Q.1 770 
-O I ./ 1 £. 


11 QH4 


17 107 


518 


LEU74 


CA 


-ov/.yoo 


10 OftQ 


1ft R91 
lO.O«CI 


519 


LEU74 




-9Q 707 
-&y./ 1// 


y.y / u 


17 Q7«\ 


520 


LEU74 






in ftQ9 
i u.oy^ 


17 QOft 


521 


LEU74 


CB 


-^0 4.74. 

VW.t/ *T 


10 fiQ7 


I O. 1 OO 


522 


LEU74 


CG 


-31 627 


10 7Q4 


14 14ft 

1 *T. 1 tO 


523 


LEU74 


CD1 


-31 .094 


11 1Q4 


1P 77ft 


524 


LEU74 


CD2 


-32 381 


q 471 


14 


525 


MET75 


N 


-29.359 


8 705 


17 454 

1 / .*tO*t 


526 


MET75 


CA 


-28.167 


8.306 


1ft POft 


527 


MET75 


c 


-27.099 


7.808 


17 243 


528 


MET75 


0 


-27.166 


6.675 


16 746 


529 


MET75 


CB 


-28.539 


7^08 


19 198 


530 


MET75 


CG 


-27.367 


6.867 


20 114 


531 


MET75 


SD 


-27.678 


5.549 


21 .31 


532 


MET75 


CE 


-28.002 


4.197 


20.154 


533 


VAL76 


N 


-26.117 


8.657 


16.992 


534 


VAL76 


CA 


-25.071 


8.327 


16.017 


535 


VAL76 


C 


-24.274 


7.103 


16.455 


536 


VAL76 


0 


-23.953 


6.925 


17.636 


537 


VAL76 


CB 


-24.151 


9.527 


15.809 


538 


VAL76 


CG1 


-24.904 


10.676 


15.149 
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539 


VAL76 


CG2 


-23.504 


9.986 


17.109 


540 


ALA77 


N 


-23.836 


6.34 


15.467 


541 


ALA77 


CA 


-23.158 


5.062 


15.727 


542 


ALA77 


C 


-21.703 


5.203 


16.177 


543 


ALA77 


O 


-21.033 


4.194 


16.42 


544 


ALA77 


CB 


-23.22 


4.212 


14.465 


545 


ARG78 


N 


-21.218 


6.431 


16.271 


546 


ARG78 


CA 


-19.868 


6.689 


16.762 


547 


ARG78 


C 


-19.868 


7.178 


18.215 


548 


ARG78 


O 


-18.816 


7.163 


18.865 


549 


ARG78 


CB 


-19.274 


7.772 


15.874 


550 


ARG78 


CG 


-19.445 


7.436 


14.398 


551 


ARG78 


CD 


-19.068 


8.629 


13.528 


552 


ARG78 


NE 


-19.848 


9.81 


13.932 


553 


ARG78 


CZ 


-19.36 


11.053 


13.921 


554 


ARG78 


NH1 


-18.114 


11.278 


13.497 


555 


ARG78 


NH2 


-20.12 


12.072 


14.33 


556 


GLN79 


N 


-21.028 


7.577 


18.722 


557 


GLN79 


CA 


-21.128 


8.129 


20.088 


558 


GLN79 


C 


-22.484 


7.818 


20.715 


559 


GLN79 


O 


-23.48 


8.503 


20.45 


560 


GLN79 


CB 


-20.937 


9.651 


20.09 


561 


GLN79 


CG 


-19.486 


10.085 


19.884 


562 


GLN79 


CD 


-19.353 


11.607 


19.931 


563 


GLN79 


OE1 


-19.071 


12.193 


20.986 


564 


GLN79 


NE2 


-19.508 


12.226 


18.773 


565 


ILE80 


N 


-22.504 


6.806 


21.562 


566 


ILE80 


CA 


-23.733 


6.44 


22.273 


567 


IUE80 


C 


-23.732 


7.034 


23.679 


568 


ILE80 


O 


-22.666 


7.316 


24.24 


569 


ILE80 


CB 


-23.847 


4.919 


22.333 


570 


ILE80 


CG1 


-22.684 


4.305 


23.109 


571 


ILE80 


CG2 


-23.905 


4.35 


20.92 


572 


ILE80 


CD1 


-22.794 


2.788 


23.191 


573 


ARG81 


N 


-24.932 


7.278 


24.188 


574 


ARG81 


CA 


-25.15 


7.84 


25.535 


575 


ARG81 


C 


-24.657 


9.276 


25.691 


576 


ARG81 


O 


-23.493 


9.571 


25.411 


577 


ARG81 


CB 


-24.51 


6.964 


26.603 


578 


ARG81 


CG 


-25.437 


5.843 


27.046 


579 


ARG81 


CD 


-25.685 


5.92 


28.555 


580 


ARG81 


NE 


-26.269 


7.22 


28.93 


581 


ARG81 


CZ 


-25.651 


8.095 


29.722 


582 


ARG81 


NH1 


-24.439 


7.82 


30.204 


583 


ARG81 


NR2 


-26.234 


9.257 


30.008 


584 


ARG82 


N 


-25.448 


10.076 


26.389 


585 


ARG82 


CA 


-25.192 


11.523 


26.511 


566 


ARG82 


C 


-23.872 


11.866 


27.204 


587 


ARG82 


O 


-23.108 


12.684 


26.682 


588 


ARG82 


CB 


-26.32 


12.122 


27.333 


589 


ARG82 


CG 


-27.683 


11.796 


26.74 


590 


ARG82 


CD 


-28.801 


12.301 


27.643 


591 


ARG82 


NE 


-28.71 


11.659 


28.967 


592 


ARG82 


CZ 


-28.623 


12.34 


30.114 



-66- 



WO 2004/061116 



PCT/US2003/034082 



593 


ARG82 


NH1 


-28.477 


11.689 


31.271 


594 


ARG82 


NH2 


-28.606 


13.675 


30.096 


595 


GLU83 


N 


-23.495 


11.077 


28.198 


596 


GLU83 


CA 


-22.237 


11.334 


28.909 


597 


GLU83 


C 


-21.03 


10.69 


28.227 


598 


GLU83 


O 


-19.894 


10.902 


28.657 


599 


GLU83 


CB 


-22.361 


10.828 


30.338 


600 


GLU83 


CG 


-23.385 


11.651 


31.114 


601 


GLU83 


CD 


-23.478 


11.172 


32.56 


602 


GLU83 


OE1 


-23.428 


9.967 


32.761 


603 


GLU83 


OE2 


-23.71 


12.011 


33.418 


604 


ASP84 


N 


-21.274 


9.941 


27.165 


605 


ASP84 


CA 


-20.201 


9.327 


26.386 


606 


ASP84 


C 


-20.095 


10.012 


25.024 


607 


ASP84 


O 


-19.257 


9.646 


24.19 


608 


ASP84 


CB 


-20.481 


7.841 


26.237 


609 


ASP84 


CG 


-20.585 


7.191 


27.613 


610 


ASP84 


OD1 


-19.547 


6.906 


28.193 


611 


ASP84 


OD2 


-21.704 


7.048 


28.092 


612 


LYS85 


N 


-20.939 


11.017 


24.831 


613 


LYS85 


CA 


-20.846 


11.928 


23.681 


614 


LYS85 


C 


-19.997 


13.228 


23.804 


615 


LYS85 


O 


-20.236 


14.084 


22.942 


616 


LYS85 


CB 


-22.27 


12.347 


23.327 


617 


LYS85 


CG 


-23.107 


11.173 


22.832 


618 


LYS85 


CD 


-24.573 


11.567 


22.679 


619 


LYS85 


CE 


-25.408 


10.408 


22.148 


620 


LYS85 


NZ 


-26.824 


10.785 


22.036 


621 


PR086 


N 


-19.054 


13.455 


24.73 


622 


PRO86 


CA 


-18.316 


14.731 


24.698 


623 


PR086 


C 


-17.168 


14.8 


23.679 


624 


PR086 


O 


-16.339 


15.713 


23.772 


625 


PRO86 


CB 


-17.779 


14.922 


26.08 


626 


PR086 


CG 


-17.866 


13.6 


26.815 


627 


PR086 


CD 


-18.543 


12.646 


25.852 


628 


PHE87 


N 


-17.14 


13.906 


22.701 


629 


PHE87 


CA 


-16.12 


13.963 


21.653 


630 


PHE87 


C 


-16.67 


14.817 


20.51 


631 


PHE87 


O 


-15.963 


15.16 


19.559 


632 


PHE87 


CB 


-15.848 


12.556 


21.128 


633 


PHE87 


CG 


-15.724 


11.449 


22.174 


634 


PHE87 


CD1 


-16.447 


10.277 


21.996 


635 


PHE87 


CD2 


-14.904 


11.591 


23.286 


636 


PHE87 


CE1 


-16.358 


9.254 


22.93 


637 


PHE87 


CE2 


-14.817 


10.567 


24.22 


638 


PHE87 


CZ 


-15.544 


9.399 


24.044 


639 


ARG88 


N 


-17.948 


15.143 


20.627 


640 


ARG88 


CA 


-18.629 


16.037 


19.686 


641 


ARG88 


C 


-18.178 


17.519 


19.7 


642 


ARG88 


O 


-18.118 


18:064 


18.59 


643 


ARG88 


CB 


-20.122 


15.915 


19.965 


644 


ARG88 


CG 


-20.964 


16.678 


18.953 


645 


ARG88 


CD 


-22.429 


16.294 


19.089 


646 


ARG88 


NE 


-22.593 


14.851 


18.868 
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647 


ARG88 


CZ 


-23.307 


14.07 


19.679 


648 


ARG88 


NH1 


-23.373 


12.757 


19.45 


649 


ARG88 


NH2 


-23.922 


14.598 


20.739 


650 


PR089 


N 


-17.919 


18.194 


20.826 


651 


PR089 


CA 


-17.186 


19.48 


20.763 


652 


PR089 


C 


-15.737 


19.333 


20.277 


653 


PR089 


0 


-14.786 


19.454 


21.057 


654 


PR089 


CB 


-17.206 


20.033 


22.154 


655 


PR089 


CG 


-17.798 


19.004 


23.096 


656 


PR089 


CD 


-18.208 


17.832 


22.224 


657 


SER90 


N 


-15.606 


19.29 


18.963 


658 


SER90 


CA 


-14.334 


19.04 


18.296 


659 


SER90 


C 


-14.43 


19.413 


16.824 


660 


SER90 


0 


-15.534 


19.592 


16.293 


661 


SER90 


CB 


-14.069 


17.55 


18.375 


662 


SER90 


OG 


-15.095 


16.928 


17.609 


663 


LEU91 


N 


-13.326 


19.173 


16.14 


664 


LEU91 


CA 


-13.154 


19.564 


14.737 


665 


LEU91 


c 


-14.042 


18.804 


13.745 


666 


LEU91 


0 


-14.491 


19.39 


12.754 


667 


LEU91 


CB 


-11.702 


19.238 


14.405 


668 


LEU91 


CG 


-11.325 


19.637 


12.988 


669 


LEU91 


CD1 


-11.253 


21.153 


12.877 


670 


LEU91 


CD2 


-9.989 


19.012 


12.606 


671 


ILE92 


N 


-14.422 


17.582 


14.076 


672 


ILE92 


CA 


-15.199 


16.791 


13.12 


673 


ILE92 


C 


-16.712 


16.857 


13.368 


674 


ILE92 


O 


-17.487 


16.497 


12.474 


675 


ILE92 


CB 


-14.682 


15.351 


13.173 


676 


ILE92 


CG1 


-15.288 


14.488 


12.07 


677 


ILE92 


CG2 


-14.932 


14.719 


14.539 


678 


ILE92 


CD1 


-14.775 


13.055 


12.137 


679 


ALA93 


N 


-17.14 


17.407 


14.494 


680 


ALA93 


CA 


-18.578 


17.414 


14.759 


681 


ALA93 


C 


-19.15 


18.822 


14.872 


682 


ALA93 


O 


-20.335 


19.048 


14.589 


683 


ALA93 


CB 


-18.865 


16.593 


16.004 


684 


MET94 


N 


-18.294 


19.776 


15.191 


685 


MET94 


CA 


-18.739 


21.168 


15.216 


686 


MET94 


C 


-18.99 


21.683 


13.811 


687 


MET94 


0 


-18.221 


21.436 


12.88 


688 


MET94 


CB 


-17.695 


22.042 


15.893 


689 


MET94 


CG 


-17.822 


21.982 


17.407 


690 


MET94 


SD 


-16.686 


23.058 


18.31 


691 


MET94 


CE 


-17.561 


23.095 


19.891 


692 


ASP95 


N 


-20.089 


22.398 


13.672 


693 


ASP95 


CA 


-20.42 


23.024 


12.393 


694 


ASP95 


C 


-19.831 


24.427 


12.371 


695 


ASP95 


O 


-19.589 


25.001 


13.438 


696 


ASP95 


CB 


-21.938 


23.082 


12.258 


697 


ASP95 


CG 


-22.52 


21.677 


12.373 


698 


ASP95 


OD1 


-22.276 


20.87 


11.484 


699 


ASP95 


OD2 


-23.173 


21.412 


13.37 


700 


PR096 


N 


-19.488 


24.936 


11.201 
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701 


PR096 


GA 


-19.076 


26.343 


11.099 


702 


PR096 


C 


-20.177 


27.263 


11.641 


703 


PR096 


O 


-21.353 


26.892 


11.581 


704 


PR096 


CB 


-18.812 


26.567 


9.64 


705 


PR096 


CG 


-19.078 


25.278 


8.875 


706 


PR096 


CD 


-19.532 


24.256 


9.905 


707 


PR097 


N 


-19.817 


28.38 


12.263 


708 


PR097 


CA 


-18.428 


28.85 


12.436 


709 


PR097 


C 


-17.649 


28.271 


13.631 


710 


PR097 


O 


-16.462 


28.595 


13.772 


711 


PR097 


CB 


-18.567 


30.329 


12.615 


712 


PR097 


CG 


-20.013 


30.649 


12.964 


713 


PR097 


CD 


-20.777 


29.345 


12.804 


714 


GLU98 


N 


-18.233 


27.353 


14.389 


715 


GLU98 


CA 


-17.555 


26.761 


15.556 


716 


GLU98 


C 


-16.353 


25.946 


15.099 


717 


GLU98 


O 


-15.224 


26.174 


15.555 


718 


GLU98 


CB 


-18.513 


25.782 


16.223 


719 


GLU98 


CG 


-19.894 


26.372 


16.474 


720 


GLU98 


CD 


-20.857 


25.236 


16.811 


721 


GLU98 


OE1 


-20.697 


24.171 


16.227 


722 


GLU98 


OE2 


-21.799 


25.49 


17.544 


723 


HIS99 


N 


-16.575 


25.223 


14.013 


724 


HIS99 


CA 


-15.52 


24.448 


13.359 


725 


HIS99 


C 


-14.43 


25.323 


12.743 


726 ' 


HIS99 


O 


-13.249 


24.993 


12.887 


727 


HIS99 


CB 


-16.19 


23.646 


12.252 


728 


HIS99 


CG 


-15.238 


23.09 


11.22 


729 


HIS99 


ND1 


-14.522 


21.957 


11.317 


730 


HIS99 


CD2 


-14.946 


23.649 


9.998 


731 


HIS99 


CE1 


-13.779 


21.805 


10.203 


732 


HIS99 


NE2 


-14.042 


22.852 


9.39 


733 


GLY100 


N 


-14.792 


26.524 


12.322 


734 


GLY100 


CA 


-13.832 


27.439 


11.702 


735 


GLY100 


C 


-12.859 


27.944 


12.756 


736 


GLY100 


O 


-11.648 


27.716 


12.64 


737 


LYS101 


N 


-13.419 


28.385 


13.872 


738 


LYS101 


CA 


-12.626 


28.895 


14.993 


739 


LYS101 


C 


-11.711 


27.823 


15.579 


740 


LYS101 


O 


-10.485 


28.02 


15.622 


741 


LYS101 


CB 


-13.608 


29.33 


16.07 


742 


LYS101 


CG 


-12.893 


29.892 


17.291 


743 


LYS101 


CD 


-13.829 


29.939 


18.492 


744 


LYS101 


CE 


-14.189 


28.531 


18.955 


745 


LYS101 


N2 


-12.986 


27.796 


19.381 


746 


ALA102 


N 


-12.251 


26.624 


15.738 


747 


ALA102 


CA 


-11.474 


25.523 


16.313 


748 


ALA102 


C 


-10.381 


25.022 


15.373 


749 


ALA102 


O 


-9J243 


24.859 


15.828 


750 


ALA102 


CB 


-12.425 


24.379 


16.645 


751 


ARG103 


N 


-10.622 


25.091 


14.074 


752 


ARG103 


CA 


-9.63 


24.64 


13.097 


753 


ARG103 


C 


-8.492 


25.644 


12.958 


754 


ARG103 


O 


-7.325 


25.236 


13.033 
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755 


ARG103 


CB 


-10.347 


24.476 


11.762 


756 


ARG103 


CG 


-9.496 


23.785 


10.705 


757 


ARG103 


CD 


-10.366 


23.455 


9.496 


758 


ARG103 


NE 


-9.651 


22.682 


8.467 


759 


ARG103 


CZ 


-9.807 


21.367 


8.287 


760 


ARG103 


NH1 


-10.493 


20.645 


9.175 


761 


ARG103 


NH2 


-9.174 


20.755 


7.285 


762 


ARG104 


N 


-8.811 


26.923 


13.087 


763 


ARG104 


CA 


-7.775 


27.957 


13.006 


764 


ARG104 


C 


-6.906 


27.966 


14.256 


765 


ARG104 


O 


-5.675 


28.083 


14.149 


766 


ARG104 


CB 


-8.442 


29.316 


12.84 


767 


ARG104 


CG 


-9.166 


29.412 


11.502 


768 


ARG104 


CD 


-9.828 


30.772 


11.319 


769 


ARG104 


NE 


-10.874 


30.999 


12.329 


770 


ARG104 


CZ 


-11.061 


32.171 


12.941 


771 


ARG104 


NH1 


-10.231 


33 188 


12.701 


772 


ARG104 


NH2 


-12.048 


32.31 


13.829 


773 


ASP105 


N 


-7.5 


27.625 


15 388 


774 


ASP105 


CA 


-6.718 


27.493 


16.616 


775 


ASP105 


c 


-5.828 


26.253 


16 594 


776 


ASP105 


0 


-4.602 


26.392 


16 716 


777 


ASP105 


CB 


-7.67 


27.427 


17 806 


778 


ASP105 


CG 


-8.198 


28.814 




779 


ASP105 


OD1 


-7.389 


29.588 


18 666 


780 


ASP105 


OD2 


-9.41 1 


28 938 


18.257 


781 


VAL106 


N 


-6.379 


25.125 


16 173 

1 V. I/O 


782 


VAL106 


CA 


-5.636 


23.86 


16 236 


783 


VAL106 


C 


-4.51 


23.761 


15.214 


784 


VAL106 


0 


-3.414 


23.348 


15.612 


785 


VAL106 


CB 


-6.611 


22.703 


16.046 


786 


VAL106 


CG1 


-5.886 


21.37 


15.89 


787 


VAL106 


CG2 


-7.587 


22.637 


17.212 


788 


VAL107 


N 


-4.641 


24.427 


14.075 


789 


VAL107 


CA 


-3.565 


24.397 


13.071 


790 


VAL107 


C 


-2.362 


25.259 


13.474 


791 


VAL107 


0 


-1.225 


24.918 


13.123 


792 


VAL107 


CB 


-4.142 


24.869 


11.737 


793 


VAL107 


CG1 


-3.06 


25.104 


10.687 


794 


VAL107 


CG2 


-5.175 


23.879 


11.213 


795 


GLY108 


N 


-2.576 


26.155 


14.426 


796 


GLY108 


CA 


-1.49 


26.985 


14.953 


797 


GLY108 


C 


-0.511 


26.183 


15.813 


798 


GLY108 


0 


0.685 


26.491 


15.837 


799 


GLU109 


N 


-1.006 


25.191 


16.537 


800 


GLU109 


CA 


-0.109 


24.388 


17.376 


801 


GLU109 


C 


0.121 


22.976 


16.836 


802 


GLU109 


O 


1.086 


22.311 


17.229 


803 


GLU109 


CB 


-0.677 


24.35 


18.784 


804 


GLU109 


CG 


-0.577 


25.728 


19.424 


805 


GLU109 


CD 


0.886 


26.1 


19.659 


806 


GLU109 


OE1 


1.612 


25.244 


20.147 


807 


GLU109 


OE2 


1.22 


27.255 


19.442 


808 


PHE110 


N 


-0.686 


22.572 


15.873 
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809 


PHE110 


CA 


-0.511 


21.269 


15.221 


810 


PHE110 


C 


0.017 


21.468 


13.798 


811 


PHE110 


0 


-0.547 


20.963 


12.819 


812 


PHE110 


CB 


-1.866 


20.568 


15.206 


813 


PHE110 


CG 


-1.834 


19.088 


14.838 


814 


PHE110 


CD1 


-0.842 


18.264 


15.352 


815 


PHE110 


CD2 


-2.808 


18.564 


13.998 


816 


PHE110 


CE1 


-0.819 


16.915 


15.019 


817 


PHE110 


CE2 


-2.784 


17.216 


13.665 


818 


PHE110 


CZ 


-1.79 


16.392 


14.176 


819 


THR111 


N 


1.103 


22.217 


13.706 


820 


THR111 


CA 


1.68 


22.585 


12.409 


821 


THR111 


C 


2.353 


21.411 


11.71 


822 


THR111 


O 


2.846 


20.47 


12.346 


823 


THR111 


CB 


2.745 


23.65 


12.628 


824 


THR111 


OG1 


3.887 


23.019 


13.193 


825 


THR111 


CG2 


2.27 


24.756 


13.561 


826 


VAL112 


N 


2.564 


21.605 


10.417 


827 


VAL112 


CA 


3.302 


20.63 


9.605 


828 


VAL112 


C 


4.802 


20.712 


9.887 


829 


VAL112 


0 


5.492 


19.689 


9.86 


830 


VAL112 


CB 


3.026 


20.929 


8.134 


831 


VAL112 


CG1 


3.819 


20.01 


7.21 


832 


VAL112 


CG2 


1.535 


20.833 


7.832 


833 


LYS113 


N 


5.227 


21.845 


10.425 


834 


LYS113 


CA 


6.608 


22.003 


10.884 


835 


LYS113 


C 


6.892 


21.096 


12.082 


836 


LYS113 


O 


7.864 


20.332 


12.044 


837 


LYS113 


CB 


6.795 


23.456 


11.298 


838 


LYS113 


CG 


8.168 


23.696 


11.914 


839 


LYS113 


CD 


8.232 


25.064 


12.582 


840 


LYS113 


CE 


7.189 


25.184 


13.692 


841 


LYS113 


N2 


7.407 


24.178 


14.747 


842 


ARG114 


N 


5.945 


21.013 


13.008 


843 


ARG114 


CA 


6.085 


20.094 


14.141 


844 


ARG114 


C 


6.098 


18.627 


13.713 


845 


ARG114 


O 


7.034 


17.912 


14.09 


846 


ARG114 


CB 


4.916 


20.313 


15.096 


847 


ARG114 


CG 


4.939 


19.283 


16.22 


848 


ARG114 


CD 


3.721 


19.388 


17.131 


849 


ARG114 


NE 


3.696 


20.666 


17.858 


850 


ARG114 


CZ 


4.078 


20.792 


19.131 


851 


ARG114 


NH1 


3.903 


21.953 


19.766 


852 


ARG114 


NH2 


4.537 


19.73 


19.798 


853 


MET115 


N 


5.28 


18.265 


12.737 


854 


MET115 


CA 


5.23 


16.862 


12.311 


855 


MET115 


C 


6.438 


16.467 


11.456 


856 


MET115 


O 


6.99 


15.378 


11.662 


857 


MET115 


CB 


3.926 


16.642 


11.555 


858 


MET115 


CG 


2.739 


16.915 


12.474 


859 


MET115 


SD 


1.093 


16.637 


11.78 


860 


MET115 


CE 


1.057 


17.94 


10.532 


861 


LYS116 


N 


7.027 


17.445 


10.787 


862 


LYS116 


CA 


8.25 


17.222 


10.01 
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863 


LYS116 


C 


9.492 


17.156 


10.902 


864 


LYS116 


0 


10.434^ 


16.413 


10.59 


865 


LYS116 


CB 


8.372 


18.392 


9.042 


866 


LYS116 


CG 


9.635 


18.337 


8.194 


867 


LYS116 


CD 


9.738 


19.592 


7.338 


868 


LYS116 


CE 


9.703 


20.841 


8.213 


869 


LYS116 


NZ 


9.753 


22.063 


7.395 


870 


ALA117 


N 


9.404 


17.748 


12.084 


871 


ALA117 


CA 


10.49 


17.663 


13.066 


872 


ALA117 


c 


10.354 


16.43 


13.962 


873 


ALA117 


o 


11.331 


16 


14.587 


874 


ALA117 


CB 


10.469 


18.924 


13.922 


875 


LEU118 


N 


9.185 


15.81 


13.933 


876 


LEU118 


CA 


8.975 


14.544 


14.64 


877 


LEU118 


c 


9.351 


13.359 


13.76 


878 


LEU118 


0 


9.591 


12.26 


14.275 


879 


LEU118 


CB 


7.512 


14.434 


15.05 


880 


LEU118 


CG 


7.153 


15.474 


16.104 


881 


LEU118 


CD1 


5.654 


15.48 


16.372 


882 


LEU118 


CD2 


7.934 


15.246 


17 393 


883 


GLN119 


N 


9.563 


13 8*32 


19 483 


884 


GLN119 


CA 


10.052 


12.633 


11 518 


885 


GLN119 


c 


11.263 


11 797 


11 989 


886 


GLN119 


o 


11 09 


10 573 


12 041 


887 


GLN119 


CB 


10.378 


13.373 


10 227 


888 


GLN119 


CG 


10.944 


12 471 


9 144 


889 


GLN119 


CD 


1 1 .394 


13.351 


7 985 


890 


GLN119 


OE1 


1 1 .701 


12.857 


6.894 


891 


GLN119 


NE2 


11.444 


14.647 


8.243 


892 


PRO120 


N 


12.388 


12.361 


12.439 


893 


PRO120 


CA 


13.485 


11.486 


12.885 


894 


PRO120 


c 


13.211 


10.732 


14.195 


895 


PRO120 


o 


13.761 


9.639 


14.381 


896 


PRO120 


CB 


14.672 


12.386 


13.05 


897 


PRO120 


CG 


14.237 


13.832 


12.892 


898 


PRO120 


CD 


12.764 


13.785 


12.533 


699 


ARG121 


N 


12.229 


11.159 


14.974 


900 


ARG121 


CA 


11.917 


10.438 


16.203 


901 


ARG121 


C 


11.02 


9.25 


15.868 


902 


ARG121 


0 


11.331 


8.136 


16.303 


903 


ARG121 


CB 


11.218 


11.385 


17.174 


904 


ARG121 


CG 


11.741 


11.209 


18.597 


905 


ARG121 


CD 


11.481 


9.812 


19.149 


906 


ARG121 


NE 


12.184 


9.613 


20.424 


907 


ARG121 


CZ 


12.714 


8.443 


20.784 


908 


ARG121 


NH1 


13.415 


8.346 


21.915 


909 


ARG121 


NH2 


12.601 


7.386 


19.977 


910 


ILE122 


N 


10.18 


9.421 


14.857 


911 


ILE122 


CA 


9.316 


8.332 


14.381 


912 


ILE122 


C 


10.135 


7.27 


13.656 


913 


ILE122 


0 


9.975 


6.073 


13.928 


914 


ILE122 


CB 


8.309 


8.918 


13.396 


915 


ILE122 


CG1 


7.456 


9.995 


14.052 


916 


ILE122 


CG2 


7.422 


7.825 


12.807 
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917 


ILE122 


CD1 


6.509 


10.636 


13.044 


916 


GLN123 


N 


11.179 


7.724 


12.982 


919 


GLN123 


CA 


12.088 


6.827 


12.269 


920 


GLN123 


C 


12.914 


5.997 


13.245 


921 


GLN123 


0 


12.897 


4.76 


13.156 


922 


GLN123 


CB 


12.989 


7.717 


11.423 


923 


GLN123 


CG 


13.978 


6.941 


10.567 


924 


GLN123 


CD 


14.72 


7.939 


9.684 


925 


GLN123 


OE1 


15.954 


7.96 


9.626 


926 


GLN123 


NE2 


13.946 


8.8 


9.044 


927 


GLN124 


N 


13.295 


6.633 


14.34 


926 


GLN124 


CA 


14.049 


5.959 


15.394 


929 


GLN124 


c 


13.184 


4.947 


16.144 


930 


GLN124 


0 


13.621 


3.798 


16.284 


931 


GLN124 


CB 


14.544 


7.04 


16.345 


932 


GLN124 


CG 


15.429 


6.495 


17.455 


933 


GLN124 


CD 


15.912 


7.668 


18.3 


934 


GLN124 


OE1 


16.786 


7.524 


19.162 


935 


GLN124 


NE2 


15.357 


8.832 


18.008 


936 


ILE125 


N 


11.904 


5.252 


16.299 


937 

Owl 


ILE125 


CA 


10.961 


4.328 


16.947 


QQQ 


ILE125 


c 


10.716 


3.072 


16.112 


939 


ILE125 


o 


10.861 


1.961 


16.642 


940 


1LE125 


CB 


9 638 


5.064 


17.148 


941 


ILE125 


CG1 


9.792 


6.214 


18.13 


942 


ILE125 


CG2 


8.543 


4.122 


17.628 


943 


ILE125 


CD1 


8 487 


6.984 


18.275 


944 


VAL126 


N 


10.649 


3.229 


14.797 


945 


VAL126 


CA 


10.44 


2.064 


13.928 


946 


VAL126 


c 


11.693 


1.2 


13.87 


947 


VAL126 


o 


11.603 


-0.021 


14.062 


948 


VAL126 


CB 


10.119 


2.529 


12.513 


949 


VAL126 


CG1 


9.754 


1.334 


11.641 


950 


VAL126 


CG2 


8.988 


3.544 


12.503 


951 


ASP127 


N 


12.843 


1.855 


13.909 


952 


ASP127 


CA 


14.121 


1.141 


13.889 


953 


ASP127 


C 


14.314 


0.34 


15.17 


954 


ASP127 


0 


14.537 


-0.876 


15.088 


955 


ASP127 


CB 


15.258 


2.153 


13.769 


956 


ASP127 


CG 


15.158 


2.967 


12.481 


957 


ASP127 


OD1 


15.632 


4.097 


12.49 


958 


ASP127 


OD2 


14.686 


2.426 


11.489 


959 


GLU128 


N 


13.903 


0.919 


16.288 


960 


GLU128 


CA 


14.048 


0.26 


17.589 


961 


GLU128 


C 


13.094 


-0.915 


17.762 


962 


GLU128 


0 


13.527 


-1.952 


18.281 


963 


GLU128 


CB 


13.764 


1.281 


18.684 


964 


GLU128 


CG 


14.807 


2.39 


18.707 


965 


GLU128 


CD 


14.367 


3.489 


19.668 


966 


GLU128 


OE1 


13.584 


4.333 


19.247 


967 


GLU128 


OE2 


14.794 


3.452 


20.812 


968 


HIS129 


N 


11.934 


-0.861 


17.128 


969 


HIS129 


CA 


11.002 


-1.985 


17.237 


970 


HIS129 


C 


11.411 


-3.142 


16.333 
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971 


HIS129 


O 


11.344 


-4.297 


16.772 


972 


HIS129 


CB 


9.592 


-1.533 


16.885 


973 


HIS129 


CG 


8.963 


-0.57 


17.87 


974 


HIS129 


ND1 


7.942 


0.266 


17.612 


975 


HIS129 


CD2 


9.3 


-0.394 


19.192 


976 


HIS129 


CE1 


7.647 


0.969 


'18.724 


977 


HIS129 


NE2 


8.488 


0.561 


19.701 


978 


ILE130 


N 


1Z061 


-2.848 


15.218 


979 


ILE130 


CA 


12.564 


-3.95 


14.394 


980 


ILE130 


C 


13.768 


-4.577 


15.089 


981 


1LE130 


O 


13.69 


-5.756 


15.459 


982 


ILE130 


CB 


12.968 


-3.449 


13.012 


983 


ILE130 


CG1 


11.841 


-2.659 


12.36 


984 


ILE130 


CG2 


13.341 


-4.632 


12.125 


985 


ILE130 


CD1 


12.258 


-2.117 


10.996 


986 


ASP131 


N 


14.65 


-3.712 


15.575 


987 


ASP131 


CA 


15.874 


-4.12 


16.283 


988 


ASP131 


C 


15.604 


-5.033 


17.473 


989 


ASP131 


O 


15.932 


-6.226 


17.435 


990 


ASP131 


CB 


16.565 


-2.874 


16.84 


991 


ASP131 


CG 


17.175 


-1.999 


15.749 


992 


ASP131 


OD1 


17.222 


-0.791 


15.952 


993 


ASP131 


OD2 


17.743 


-2.564 


14.826 


994 


ALA132 


N 


14.882 


-4.505 


18.448 


995 


ALA132 


CA 


14.708 


-5.198 


19.727 


996 


ALA132 


C 


13.582 


-6.228 


19.763 


997 


ALA132 


O 


13.489 


-6.983 


20.738 


998 


ALA132 


CB 


14.465 


-4.147 


20.803 


999 


LEU133 


N 


12.752 


-6.286 
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25.784 


1773 


LEU232 


CD1 


-7.057 


20.385 


26.124 


1774 


LEU232 


CD2 


-7.256 


22.293 


24.486 


1775 


VAL233 


N 


-10.877 


18.804 


25.419 


1776 


VAL233 


CA 


-11.016 


17.36 


25.627 


1777 


VAL233 


C 


-11.406 


16.646 


24.331 


1778 


VAL233 


O 


-10.729 


15.676 


23.963 


1779 


VAL233 


CB 


-12.066 


17.13 


26.71 


1780 


VAL233 


CG1 


-12.478 


15.667 


26.802 
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1781 


VAL233 


CG2 


-11.571 


17.632 


28.062 


1782 


GLY234 


N 


-12.258 


17.278 


23.536 


1783 


GLY234 


CA 


-12.641 


16.729 


22.227 


1784 


GLY234 


C 


-11.443 


16.59 


21.287 


1785 


GLY234 


0 


-11.116 


15.477 


20.849 


1786 


LEU235 


N 


-10.687 


17.668 


21.146 


1787 


LEU235 


CA 


-9.532 


17.691 


20.238 


1788 


LEU235 


C 


-8.375 


16.809 


20.702 


1789 


LEU235 


0 


-7.846 


16.046 


19.882 


1790 


LEU235 


CB 


-9.04 


19.13 


20.146 


1791 


LEU235 


CG 


-10.08 


20.034 


19.495 


1792 


LEU235 


CD1 


-9.781 


21.505 


19.761 


1793 


LEU235 


CD2 


-10.2 


19.75 


18.003 


1794 


ALA236 


N 


-8.162 


16.713 


22.006 


1795 


ALA236 


CA 


-7.065 


15.885 


22.523 


1796 


ALA236 


c 


-7.38 


14.407 


22.375 


1797 


ALA236 


0 


-6.525 


13.633 


21.922 


1798 


ALA236 


CB 


-6.861 


16.193 


24.002 


1799 


PHE237 


N 


-8.66 


14.095 


22.475 


1800 


PHE237 


CA 


-9.11 


12.723 


22.306 


1801 


PHE237 


c 


-8.956 


12.273 


20.864 


1802 


PHE237 


0 


-8.27 


11.274 


20.617 


1803 


PHE237 


CB 


-10.58 


12.657 


22.682 


1804 


PHE237 


CG 


-11.12 


11.24 


22.674 


1805 


PHE237 


CD1 


-10.824 


10.394 


23.733 


1806 


PHE237 


CD2 


-11.885 


10.786 


21.608 


1807 


PHE237 


CE1 


-11.305 


9.095 


23.736 


1808 


PHE237 


CE2 


-12.366 


9.486 


21.61 


1809 


PHE237 


CZ 


-12.076 


8.644 


22.676 


1810 


LEU238 


N 


-9.329 


13.135 


19.931 


1811 


LEU238 


CA 


-9.272 


12.754 


18.516 


1812 


LEU238 


C 


-7.845 


12.681 


17.984 


1813 


LEU238 


0 


-7.532 


11.745 


17.236 


1814 


LEU238 


CB 


-10.056 


13.766 


17.695 


1815 


LEU238 


CG 


-11.539 


13.75 


18.042 


1816 


LEU238 


CD1 


-12.279 


14.795 


17.221 


1817 


LEU238 


CD2 


-12/145 


12.369 


17.814 


1818 


LEU239 


N 


-6.947 


13.467 


18.554 


1819 


LEU239 


CA 


-5.551 


13.397 


18.122 


1820 


LEU239 


C 


-4.861 


12.158 


18.68 


1821 


LEU239 


O 


-4.202 


11.444 


17.913 


1822 


LEU239 


CB 


-4.821 


14.652 


18.586 


1823 


LEU239 


CG 


-5.364 


15.898 


17.894 


1824 


LEU239 


QD1 


-4.722 


17.162 


18.454 


1825 


LEU239 


CD2 


-5.169 


15.82 


16.384 


1826 


LEU240 


N 


-5.275 


11.736 


19.864 


1827 


LEU240 


CA 


-4.667 


10.56 


20.485 


1828 


LEU240 


C 


-5.217 


9.265 


19.877 


1829 


LEU240 


O 


-4.445 


8.316 


19.674 


1830 


LEU240 


CB 


-4.952 


10.634 


21.981 


1831 


LEU240 


CG 


-3.966 


9.808 


22.798 


1832 


LEU240 


CD1 


-2.53 


10.2 


22.479 


1833 


LEU240 


GD2 


-4.227 


9.976 


24.289 


1834 


ILE241 


N 


-6.425 


9.337 


19.333 
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1835 


ILE241 


CA 


-7.027 


8.197 


18.621 


1836 


ILE241 


C 


-6.509 


8.091 


17.184 


1837 


1LE241 


O 


-6.406 


6.98 


16.64 


1838 


ILE241 


CB 


-8.544 


8.391 


18.599 


1839 


ILE241 


CG1 


-9.122 


8.403 


20.008 


1840 


ILE241 


CG2 


-9.233 


7.311 


17.771 


1841 


1LE241 


CD1 


-8.939 


7.06 


20.699 


1842 


ALA242 


N 


-5.959 


9.182 


16.676 


1843 


ALA242 


CA 


-5.353 


9.154 


15.345 


1844 


ALA242 


C 


-4.049 


8.363 


15.361 


1845 


ALA242 


O 


-3.842 


7.511 


14.487 


1846 


ALA242 


CB 


-5.09 


10.583 


14.884 


1847 


GLY243 


N 


-3.34 


8.426 


16.473 


1848 


GLY243 


CA 


-2.135 


7.612 


16.614 


1849 


GLY243 


C 


-2.461 


6.175 


17.012 


1850 


GLY243 


0 


-2.095 


5.238 


16.289 


1851 


HIS244 


N 


-3.274 


6.021 


18.047 


1852 


HIS244 


CA 


-3.556 


4.687 


18 603 


1853 


HIS244 . 


c 


-4.424 


3.78 


17.729 


1854 


HIS244 


0 


-4.288 


2.555 


17.808 


1855 


HIS244 


CB 


-4.271 


4.846 


19.946 


1856 


HIS244 


CG 


-3.394 


5.206 


21.132 


1857 


HIS244 


ND1 


-2.995 


6.436 


21 502 


1858 


HIS244 


CD2 


-2.87 


4.327 


22.051 


1859 


HIS244 


CE1 


-2.232 


6.348 


22 61 


1860 


HIS244 


NE2 


-2.156 


5.042 


22.95 


1861 


GLU245 


N 


-5.298 


4.33 


1 6.905 


1862 


GLU245 


CA 


-6.157 


3.454 


16.101 


1863 


GLU245 


C 


-6.041 


3.723 


14.61 


1864 


GLU245 


O 


-5.883 


2.788 


13.812 


1865 


GLU245 


CB 


-7.616 


3.66 


16.504 


1866 


GLU245 


CG 


-7.895 


3.313 


17.963 


1867 


GLU245 


CD 


-7.613 


1.839 


18.255 


1868 


GLU245 


OE1 


-7.721 


1.028 


17.348 


1869 


GLU245 


OE2 


-7.228 


1.562 


19.381 


1870 


THR246 


N 


-5.987 


4.996 


14.26 


1871 


THR246 


CA 


-6.101 


5.374 


12.849 


1872 


THR246 


C 


-4.863 


4.962 


12.066 


1873 


THR246 


0 


-4.949 


4.04 


11.244 


1874 


THR246 


CB 


-6.314 


6.88 


12.759 


1875 


THR246 


OG1 


-7.415 


7.225 


13.59 


1876 


THR246 


CG2 


-6.614 


7.344 


11.338 


1877 


THR247 


N 


-3.702 


5.384 


12.532 


1878 


THR247 


CA 


-2.48 


5.072 


11.792 


1879 


THR247 


C 


-2.013 


3.634 


12.044 


1880 


THR247 


0 


-1.437 


3.032 


11.132 


1881 


THR247 


CB 


-1.4 


6.078 


12.171 


1882 


THR247 


OG1 


-1.937 


7.39 


12.062 


1883 


THR247 


CG2 


-0.192 


5.981 


11.246 


1884 


ALA248 


N 


-2.533 


3.007 


13.09 


1885 


ALA248 


CA 


-2.225 


1.596 


13.35 


1886 


ALA248 


C 


-2.915 


0.676 


12.342 


1887 


ALA248 


O 


-2.236 


-0.089 


11.643 


1888 


ALA248 


CB 


-2.678 


1.25 


14.763 
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1889 


ASN249 


N 


-4.175 


0.958 


12.049 


1890 


ASN249 


CA 


-4.878 


0.157 


11.042 


1891 


ASN249 


C 


-4.453 


0.517 


9.624 


1892 


ASN249 


0 


-4.313 


-0.39 


8.791 


1893 


ASN249 


CB 


-6.377 


0.361 


11.199 


1894 


ASN249 


CG 


-6.902 


-0.561 


12.291 


1895 


ASN249 


0D1 


-6.261 


-1.569 


12.613 


1896 


ASN249 


ND2 


-8.133 


-0.32 


12.703 


1897 


MET250 


N 


-3.945 


1.727 


9.463 


1898 


MET250 


CA 


-3.472 


2.183 


8.159 


1899 


MET250 


C 


-2.137 


1.529 


7.787 


1900 


MET250 


0 


-2.018 


1.009 


6.669 


1901 


MET250 


CB 


-3.231 


3.702 


8.255 


1902 


MET250 


CG 


-3.343 


4.33 


6.851 


1903 


MET250 


SD 


-2.234 


5.768 


6.665 


1904 


MET250 


CE 


-1.954 


6.241 


8.406 


1905 


ILE251 


N 


-1.267 


1.317 


8.764 


1906 


ILE251 


CA 


0.019 


0.667 


8.477 


1907 


ILE251 


C 


-0.132 


-0.844 


8.328 


1908 


ILE251 


O 


0.449 


-1.427 


7.402 


1909 


ILE251 


CB 


1.001 


0.949 


9.613 


1910 


ILE251 


CG1 


1.293 


2.434 


9.757 


1911 


ILE251 


CG2 


2.305 


0.188 


9.404 


1912 


ILE251 


CD1 


2.264 


2.698 


10.902 


1913 


SER252 


N 


-1.081 


-1.421 


9.047 


1914 • 


SER252 


CA 


-1.254 


-2.875 


8.973 


1915 


SER252 


C 


-1.947 


-3.301 


7.679 


1916 


SER252 


0 


-1.444 


-4.204 


6.998 


1917 


SER252 


CB 


-2.03 


-3.36 


10.197 


1918 


SER252 


OG 


-3.28 


-2.684 


10.279 


1919 


LEU253 


N 


-2.85 


-2.465 


7.194 


1920 


LEU253 


CA 


-3.558 


-2.762 


5.948 


1921 


LEU253 


C 


-2.699 


-2.386 


4.739 


1922 


LEU253 


O 


-2.665 


-3.129 


3.748 


1923 


LEU253 


CB 


-4.857 


-1.963 


5.994 


1924 


LEU253 


■ CG 


-5.781 


-2.185 


4.804 


1925 


LEU253 


CD1 


-6.009 


-3.665 


4.521 


1926 


LEU253 


' CD2 


-7.109 


-1.479 


5.058 


1927 


GLY254 


N 


-1.818 


-1.42 


4.948 


1928 


GLY254 


CA 


-0.841 


-1.031 


3.933 


1929 


GLY254 


C 


0.161 


-2.148 


3.671 


1930 


GLY254 


O 


0.267 


-2.62 


2.531 


1931 


THR255 


N 


0.707 


-2.706 


4.742 


1932 


THR255 


CA 


1.711 


-3.772 


4.62 


1933 


THR255 


C 


1.123 


-5.042 


4.017 


1934 


THR255 


O 


1.715 


-5.592 


3.079 


1935 


THR255 


CB 


2.255 


-4.105 


6.007 


1936 


THR255 


QG1 


2.837 


-2.935 


6.563 


1937 


THR255 


CG2 


3.334 


-5.18 


5.938 


1938 


VAL256 


N 


-0.133 


-5.321 


4.331 


1939 


VAL256 


CA 


-0.797 


-6.505 


3.781 


1940 


VAL256 


C 


-1.11 


-6.37 


2.291 


1941 


VAL256 


O 


-0.832 


-7.307 


1.531 


1942 


VAL256 


CB 


-2.083 


-6.706 


4.571 
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1943 


VAL256 


CG1 


-3.028 


-7.693 


3.905 


1944 


VAL256 


CG2 


-1.774 


-7.144 


5.993 


1945 


THR257 


N 


-1.372 


-5.158 


1 833 


1946 


THR257 


CA 


-1.675 


-4.982 


0 413 


1947 


THR257 


C 


-0.401 


-4.902 


-0.427 


1948 


THR257 


o 


-0.357 


-5.499 


-1 512 

1 .V? IC 


1949 


THR257 


CB 


-2.502 


-3.717 


0 94ft 


1950 


THR257 


0G1 


-3.63 


-3.818 


1 106 
1.1 uu 


1951 


THR257 


CG2 


-3.002 


-3.558 


-1 183 
1.1 00 


1952 


LEU258 


N 


0.69 


-4 465 


v/. 1 OO 


1953 


LEU258 


CA 


1.976 


-4 434 




1954 


LEU258 


c 


2.588 


-5 R9ft 

w.U<.U 


-O R1 


1955 


LEU258 


o 


3 147 


-ft 205 


-1 ft 4ft 

1 .OHO 


1956 


LEU258 


CB 


2 936 


0-0&"r 


ft 933 


1957 


LEU258 


CG 




-9 HA 


ft 9ft1 


1958 


LEU258 


CD1 

WW 1 


3 39Q 

O.O^ 57 


-1 99Q 


1 179 
1*1/ c. 


1959 


LFU258 

UUwCwU 


CD2 


2 3Q1 


-1 409 


-1 141 


1960 


I FU25Q 


N 


9 94A 


jp. ceo 


U.000 


1 961 


I PI 1259 






-ft n^4 


ft 4ft7 


1962 


LEIJ25Q 




1 A1A 
I.O ID 


-o.yoo 


-ft A(X7 


1963 

1 www 


LFU259 




9 99Q 


- IU.UOO 


-ft "747 


1964 


LEU259 


HR 


9 749 


O yfOO 


1 ftftQ 
1 .009 


1965 


1 FU259 


ova 


07Q 

0.57/ 






1966 


LFU25Q 


v/U 1 




-o.ooy 


1 797 


1967 


1 FU95Q 




a ft*; 


Q R1 
-57.01 


1 CZ70 


1968 


GLU260 




ft ft4Q 


-ft 47** 
-o.*f to 


-ft 7Q1 
-U. / 57 1 


1969 


gi UPRn 






-Q 91 ^ 


1 7ft7 


1970 


GLlJ9fift 




ft 1fl*3 


-ft o/*9 


-Q 1 K1 


1971 






n ft^ 


-Q ftft4 


-4 ft1Q 
-*t.U iy 


1972 


GLU260 


CR 


-1 fi7 

- 1 .O/ 


-ft 773 
-0. / / o 


-1 531 


1973 


GLU260 


CG 


-9 *TO 


-0 ft* 


-ft ftftft 
-u.000 


1974 


GLU260 


CD 


-9 449 


-Q 7ft 
57./ 0 


ft ftft7 


1975 


GLU260 

L_\JCWv 


OF1 

wU 1 




-in 4Qft 


1 9ft4 


1976 


GLU260 


OE2 


_Q 9«7 


-57. 1 uo 


1 9ft9 


1977 


ASN261 


N 


0 693 


-7 743 


~3 382 


1978 


ASN261 


CA 


1 158 

I.I w 


-7 381 


-4 723 


1979 


ASN261 


c 


2.669 


-7.181 


-4 742 


1980 


ASN261 


0 


3.134 


-6.032 


-4.804 


1981 


ASN261 


CB 


0.468 


-6.089 


-5.161 


1982 


ASN261 


CG 


-0.986 


-6.311 


-5.586 


1983 


ASN261 


0D1 


-1.784 


-6.942 


■4.881 


1984 


ASN261 


ND2 


-1.339 


-5.701 


-6.705 


1985 


PR0262 


N 


3.404 


-8.259 


-4.986 


1986 


PR0262 


CA 


4.864 


-8.219 


-4.846 


1987 


PR0262 


c 


5.563 


-7.432 


-5.958 


1988 


PR0262 


0 


6.612 


-6.834 


-5.706 


1989 


PR0262 


CB 


5.298 


-9.652 


-4.871 


1990 


PR0262 


CG 


4.108 


-10.533 


-5.223 


1991 


PR0262 


CD 


2.912 


-9.601 


-5.319 


1992 


ASP263 


N 


4.884 


-7.222 


-7.077 


1993 


ASP263 


CA 


5.442 


-6.411 


-8.166 


1994 


ASP263 


C 


5.27 


-4.909 


-7.918 


1995 


ASP263 


O 


6.133 


-4.124 


-8.327 


1996 


ASP263 


CB 


4.783 


-6.822 


-9.488 
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1997 


ASP263 


CG 


3.253 


-6.778 


-9.42 


1998 


ASP263 


OD1 


2.69 


-5.734 


-9.721 


1999 


ASP263 


OD2 


2.673 


-7.766 


-8.992 


2000 


GLN264 


N 


4.358 


-4.558 


-7.024 


2001 


GLN264 


CA 


4.149 


-3.154 


-6.675 


2002 


GLN264 


C 


5.104 


-2.803 


-5.545 


2003 


GLN264 


0 


5.782 


-1.769 


-5.59 


2004 


GLN264 


CB 


2.709 


-3.002 


-6.206 


2005 


GLN264 


CG 


1.723 


-3.483 


-7.265 


2006 


GLN264 


CD 


1.637 


-2.485 


-8.412 


2007 


GLN264 


OE1 


1.631 


-1.274 


-8.178 


2008 


GLN264 


NE2 


1.592 


-2.991 


-9.63 


2009 


LEU265 


N 


5.398 


-3.825 


-4.758 


2010 


LEU265 


CA 


6.389 


-3.701 


-3.693 


2011 


LEU265 


C 


7.808 


-3.677 


-4.263 


2012 


LEU265 


0 


8.639 


-2.888 


-3.798 


2013 


LEU265 


CB 


6.218 


-4.903 


-2.775 


2014 


LEU265 


CG 


7.134 


-4.809 


-1.565 


2015 


LEU265 


CD1 


6.869 


-3.518 


-0.802 


2016 


LEU265 


CD2 


6.958 


-6.023 


-0.662 


2017 


ALA266 


N 


7.993 


-4.309 


-5.411 


2018 


ALA266 


CA 


9.286 


-4.267 


-6.093 


2019 


ALA266 


C 


9.528 


-2.929 


-6.785 


2020 


ALA266 


O 


10.66 


-2.434 


-6.737 


2021 


ALA266 


CB 


9.333 


-5.39 


-7.123 


2022 


LYS267 


N 


8.466 


-2.235 


-7.167 


2023 


LYS267 


CA 


8.641 


-0.891 


-7.725 


2024 


LYS267 


C 


8.887 


0.136 


-6.625 


2025 


LYS267 


O 


9.706 


1.042 


-6.818 


2026 


LYS267 


CB 


7.406 


-0.501 


-8.523 


2027 


LYS267 


CG 


7.223 


-1.394 


-9.742 


2028 


LYS267 


CD 


6.072 


-0.894 


-10.604 


2029 


LYS267 


CE 


4.779 


-0.825 


-9.803 


2030 


LYS267 


NZ 


3.688 


-0.267 


-10.615 


2031 


ILE268 


N 


8.413 


-0.162 


-5.427 


2032 


ILE268 


CA 


8.717 


0.664 


-4.254 


2033 


ILE268 


C 


10.181 


0.531 


-3.84 


2034 


ILE268 


O 


10.879 


1.544 


-3.697 


2035 


ILE268 


CB 


7.827 


0.171 


-3.117 


2036 


ILE268 


CG1 


6.38 


0.576 


-3.335 


2037 


ILE268 


CG2 


8.311 


0.638 


-1.75 


2038 


ILE268 


CD1 


5.504 


0.057 


-2.205 


2039 


LYS269 


N 


10.693 


-0.688 


-3.902 


2040 


LYS269 


CA 


12.068 


-0.95 


-3.463 


2041 


LYS269 


C 


13.116 


-0.675 


-4.542 


2042 


LYS269 


O 


14.306 


-0.554 


-4.228 


2043 


LYS269 


CB 


12.126 


-2.405 


-3.021 


2044 


LYS269 


CG 


11.167 


-2.628 


-1.858 


2045 


LYS269 


CD 


10.997 


-4.107 


-1.542 


2046 


LYS269 


CE 


12.315 


-4.756 


-1.148 


2047 


LYS269 


NZ 


12.106 


-6.181 


-0.856 


2048 


ALA270 


N 


12.679 


-0.53 


-5.782 


2049 


ALA270 


CA 


13.585 


-0.106 


-6.851 


2050 


ALA270 


C 


13.478 


1.396 


-7.101 



-93- 



WO 2004/061116 



PCT/US2003/034082 



2051 


ALA270 


O 


14.286 


1.974 


-7.838 


2052 


ALA270 


CB 


13.233 


-0.863 


-8.125 


2053 


ASP271 


N 


12.486 


2.017 


-6.486 


2054 


ASP271 


CA 


12.271 


3.453 


-6.649 


2055 


ASP271 


C 


11.505 


4.006 


-5.448 


2056 


ASP271 


0 


10.267 


4.02 


-5.445 


2057 


ASP271 


CB 


11.48 


3.653 


-7.944 


2058 


ASP271 


CG 


11.354 


5.125 


-8.337 


2059 


ASP271 


0D1 


10.975 


5.919 


-7.482 


2060 


ASP271 


0D2 


11.493 


5.405 


-9.517 


2061 


PR0272 


N 


12.238 


4.705 


-4.592 


2062 


PR0272 


CA 


11.686 


5.248 


-3.337 


2063 


PR0272 


C 


10.72 


6.438 


-3.497 


2064 


PR0272 


O 


9.973 


6.74 


-2.558 


2065 


PR0272 


CB 


12.887 


5.657 


-2.54 


2066 


PR0272 


CG 


14.133 


5.544 


-3.406 


2067 


PR0272 


CD 


13.673 


4.968 


-4.734 


2068 


GLY273 


N 


10.586 


6.971 


-4.702 


2069 


GLY273 


CA 


9.607 


8.03 


-4.968 


2070 


GLY273 


C 


8.207 


7.423 


-5.037 


2071 


GLY273 


0 


7.257 


7.968 


-4.456 


2072 


LYS274 


N 


8.167 


6.171 


-5.473 


2073 


LYS274 


CA 


6.916 


5.418 


-5.566 


2074 


LYS274 


C 


6.39 


4.954 


-4.21 


2075 


LYS274 


O 


5.225 


4.55 


-4.144 


2076 


LYS274 


CB 


7.138 


4.184 


-6.431 


2077 


LYS274 


CG 


7.528 


4.547 


-7.856 


2078 


LYS274 


CD 


7.755 


3.287 


-8.681 


2079 


LYS274 


CE 


8.177 


3.615 


-10.107 


2080 


LYS274 


NZ 


8.465 


2.383 


-10.858 


2081 


THR275 


N 


7.12 


5.182 


-3.127 


2082 


THR275 


CA 


6.593 


4.814 


-1.813 


2083 


THR275 


C 


5.522 


5.804 


-1.352 


2084 


THR275 


O 


4.516 


5.354 


-0.798 


2085 


THR275 


CB 


7.725 


4.76 


-0.789 


2086 


THR275 


OG1 


8.169 


6.074 


-0.485 


2087 


THR275 


CG2 


8.912 


3.963 


-1.305 


2088 


LEU276 


N 


5.564 


7.036 


-1.844 


2089 


LEU276 


CA 


4.543 


8.012 


-1.446 


2090 


LEU276 


C 


3.312 


7.905 


-2.346 


2091 


LEU276 


O 


2.175 


8.051 


-1.876 


2092 


LEU276 


CB 


5.14 


9.411 


-1.538 


2093 


LEU276 


CG 


4.182 


10.462 


-0.987 


2094 


LEU276 


CD1 


3.836 


10.177 


0.472 


2095 


LEU276 


CD2 


4.77 


11.861 


-1.133 


2096 


ALA277 


N 


3.53 


7.375 


-3.539 


2097 


ALA277 


CA 


2.417 


7.126 


-4.451 


2098 


ALA277 


C 


1.711 


5.836 


-4.052 


2099 


ALA277 


O 


0.475 


5.796 


-4.026 


2100 


ALA277 


CB 


2.963 


7.021 


-5.869 


2101 


ALA278 


N 


2.472 


4.947 


-3.431 


2102 


ALA278 


CA 


1.909 


3.727 


-2.859 


2103 


ALA278 


C 


1.12 


4.01 


-1.591 


2104 


ALA278 


O 


0.051 


3.416 


-1.427 



-94- 



WO 2004/061116 



PCT/US2003/034082 



2105 


ALA278 


CB 


3.048 


2.773 


-2.523 


2106 


ILE279 


N 


1.472 


5.064 


-0.867 


2107 


ILE279 


CA 


0.698 


5.447 


0.324 


2108 


ILE279 


C 


-0.692 


5.926 


-0.078 


2109 


ILE279 


0 


-1.691 


5.368 


0.396 


2110 


ILE279 


CB 


1.401 


6.588 


1.063 


2111 


ILE279 


CG1 


2.806 


6.209 


1.513 


2112 


ILE279 


CG2 


0.577 


7.041 


2.264 


2113 


ILE279 


CD1 


2.805 


4.984 


2.416 


2114 


GLU280 


N 


-0.748 


6.688 


-1.159 


2115 


GLU280 


CA 


-2.037 


7.217 


-1.616 


2116 


GLU280 


c 


-2.849 


6.159 


-2.358 


2117 


GLU280 


o 


-4.075 


6.123 


-2.217 


2118 


GLU280 


CB 


-1.784 


8.404 


-2.537 


2119 


GLU280 


CG 


-0.943 


9.484 


-1 862 


2120 


GLU280 


CD 


-1.616 


9 992 


-n *?87 


2121 


GLU280 


OE1 


-2.453 


10 875 




2122 


GLU280 


OE2 


-1 19 




0 479 


2123 


GLU281 


N 


-2 169 


5 17^ 


-P Q1R 


2124 


GLU281 


CA 




4 OR^ 




2125 


GLU281 


c 






-P 61ft 


2126 


GLU281 


o 


-4 4Q7 


P *%14 


-P 84P 


2127 


GLU281 


CB 


-1 846 


Q AAA 
0.*r*r 1 


A CAR 


2128 


GLU281 


CG 


-p 4m 


P ^PQ 


-\J.057 


2129 


GLU281 


CD 


"O.057sJ 


P fifiP 




2130 


GLU281 


OE1 


_q QQO 

0.0570 


p ni7 


-7 1QQ 


2131 


GLU281 


OE2 


-3.446 


4 068 


-6 6R3 


2132 


LEU282 


N 


_p 77fi 

Cm* t 1 \J 


2 957 


-1 4^P 


2133 


LEU282 


CA 


-3 272 


P 10P 




2134 


LEU282 


C 


-4.527 


2.703 


0 P^P 


2135 


LEU282 


o 


-5.563 


2.03 


0 269 


2136 


LEU282 


CB 


-2.212 


2.019 


0 727 


2137 


LEU282 


CG 


-1.025 


1.151 


0.332 


2138 


LEU282 


CD1 


0.163 


1.385 


1.256 


2139 


LEU282 


CD2 


-1.415 


-0.319 


0.305 


2140 


LEU283 


N 


-4.511 


4.018 


0.378 


2141 


LEU283 


CA 


-5.666 


4.738 


0.922 


2142 


LEU283 


c 


-6.823 


4.808 


-0.066 


2143 


LEU283 


O 


-7.989 


4.766 


0.346 


2144 


LEU283 


CB 


-5.202 


6.145 


1.257 


2145 


LEU283 


CG 


-4.178 


6.103 


2.379 


2146 


LEU283 


CD1 


-3.386 


7.399 


2.461 


2147 


LEU283 


CD2 


-4.848 


5.766 


3.706 


2148 


ARG284 


N 


-6.512 


4.723 


-1.346 


2149 


ARG284 


CA 


-7.561 


4.67 


-2.355 


2150 


ARG284 


C 


-8.306 


3.337 


-2.3 


2151 


ARG284 


O 


-9.482 


3.316 


-1.923 


2152 


ARG284 


CB 


-6.921 


4.824 


-3.726 


2153 


ARG284 


CG 


-7.99 


4.956 


-4.798 


2154 


ARG284 


CD 


-7.417 


4.674 


-6.18 


2155 


ARG284 


NE 


-6.879 


3.305 


-6.258 


2156 


ARG284 


CZ 


-7.603 


2.231 


-6.583 


2157 


ARG284 


NH1 


-8.911 


2.35 


-6.816 


2158 


ARG284 


NH2 


-7.021 


1.032 


-6.658 



-95- 



WO 2004/061116 



PCT/US2003/034082 



2159 


ILE285 


N 


-7.588 


2 233 


-P ilOO 


2160 


ILE285 


CA 


-8.284 


0 Q4.2 


-^.0 iy 


2161 


ILE285 


c 


-8.742 


o ^7 

0.057/ 


-1 iftR 

- 1. TOO 


2162 


ILE285 


o 


-9.703 




-1 f\Q7 


2163 


ILE285 


CB 


-7.36 




_o OA7 
"0.^4/ 


2164 


ILE285 


CG1 


-8.033 


-1 ^RR 


O 

-o.4yy 


2165 


ILE285 


CG2 


-6 045 


-O P^R 


_0 CA1 


2166 


ILE285 


CD1 


-7 11Q 


-P *^P7 


-A 070 

-4.^/^1 


2167 


PHE286 


N 


^J. I OO 


n qo4 
u.yoH- 


A noo 


2168 


PHE286 


CA 


-o.o*ro 


U.D 




2169 


PHP286 




-R 4m 

"O.HO I 




O OTP 

2.27 


2170 


PHPPftfi 

l 1 XI ,r. KJKJ 


n 


J7 KAA 
- / .OHH 


l.DD 


3.115 


2171 


PHE286 


OB 


-7 07^ 


-u.oy4 


1.708 


2172 


PHPPftfi 






- i.y4o 


4 ooe 

1.025 


2173 

Cm 1 / O 


PHPPftfi 


OU 1 


-ft ft7R 


O RAO 


o ooo 

0.323 


Cm 1 / t 


PHPPftfi 
rnccOD 






O COO 


1.117 


Cm 1 / \J 


PHPPftfi 






-0.736 


-0.298 


2176 


PHPPftfi 


PPP 


-y.uy 


-O.OD0 


0.495 


exit 


PHPPftfi 




Q AO 


-4.41 7 


-0.213 


21 7R 

C X I o 


THRPR7 


M 
l>l 


-y.o/ / 


2.584 


2.193 


21 7Q 


THRPR7 
i nntO/ 


Vmtf\ 


A /HO 


o.ooo 


3.212 




THRPR7 


L» 


-y.yoi 


3.053 


4.519 


P1R1 


' THRPR7 




-1 0.o43 


2.216 


4.543 


P1RP 


1 nrliioY 


Ud 


-i r\ oo a 

-1 0.294 


4.801 


2.77 


P1RR 
^ 1 oo 






-i 0.207 


5.833 


3.745 


P1R4 

^ 1 0*T 


TWRPR7 




- 1 1 .759 


4.423 


2.643 


P1R5 


II POOR 


Kl 
IM 


A OAC 

-y.oOo 


o >l co 

3.459 


5.609 


P1Rfi 


11 COQQ 
1L.EZj£00 


OA 
OA 


A 7AIZ 


2.966 


O 0-4 4 

6.911 


2187 


II PPftft 




-i o nee 


3.747 


^ OO M 

7.391 


2188 


II PPRR 




-J J .bob 


o h err 
3. IO0 


O OH A 

8.014 


2189 


II PPRR 


PR 


-O.Oo4 


O.U44 


7.894 


2190 


ILPPRR 


Pft1 


-o.U4 


A ACZQ 

4.4oy 


O A4 0 


2191 


II PPRR 


HAP 


-7 /IRQ 




7.46 1 


2192 


II PPRR 


cni 


-ft QOQ 


>l KA7 
4.0U/ 


Q ACO 


2193 


ALA289 


in 


-11 11ft 

- 1 I.I ID 


A Oft*? 


ft QQ 

o.oy 


2194 


ALA289 


CA 


-1P 


R 7AO 


V AO 


2195 


ALA289 




- IO.OI o 


o.o/ 




2196 


ALA289 


o 


- 1 w.HOO 


ft ftftp 


A QftA 
4.yo4 


2197 


ALAP89 


CB 


-1 p nrn 


7 PPR 


7 ni o 


2198 


GLU290 


N 




d P^Q 


ft 17A 
0. I / 4 


2199 


GLU290 


CA 


-1 4 7QR 
1 *+•/ oo 


RRP 


R 1PR 


2200 


GLU290 


c 




il 17Q 


^ 17R 


2201 


GLU290 


o 


-16 891 


4 234 


A 1^R 
*t. lOo 


2202 


GLU290 


CB 


-1 4 766 


2 14 


0.04 


2203 


GLU290 


CG 


-15 669 


1 37*5 


4 Q,R1 
t.oo 1 


2204 


GLU290 


CD 


-15.736 


-0 091 


4 RHP 


2205 


GLU290 


OE1 


-16.751 


-0.475 


5 367 


2206 


GLU290 


OE2 


-14.809 


-0.819 


4.472 


2207 


THR291 


N 


-16.659 


4.619 


6.347 


2208 


THR291 


CA 


-17.992 


5.216 


6.481 


2209 


THR291 


C 


-17.983 


6.47 


7.349 


2210 


THR291 


O 


-17.68 


6.437 


8.55 


2211 


THR291 


CB 


-18.978 


4.219 


7.094 


2212 


THR291 


OG1 


-18.522. 


3.832 


8.382 



-96- 



WO 2004/061116 



PCTAJS2003/034082 



2213 


THR291 


CG2 


-19.168 


2.961 


6.257 


2214 


ALA292 


N 


-18.397 


7.562 


6.734 


2215 


ALA292 


CA 


-18.676 


8.799 


7.471 


2216 


ALA292 


C 


-20.103 


8.704 


8.001 


2217 


ALA292 


O 


-21.051 


9.169 


7.355 


2218 


ALA292 


CB 


-18.552 


9.986 


6.521 


2219 


THR293 


N 


-20.231 


8.1 


9.172 


2220 


THR293 


CA 


-21.535 


7.709 


9.721 


2221 


THR293 


C 


-22.161 


8.802 


10.585 




THR293 


O 


-22.152 


8.733 


11.82 


2223 


THR293 


CB 


-21.308 


6.447 


10.546 


2224 


THR293 


OG1 


-20.564 


5.523 


9.756 


2225 


THR293 


CG2 


-22.617 


5.788 


10.962 


2226 


SER294 


N 


-22.703 


9.805 


9.913 


2227 


SER294 


CA 


-23.293 


10.956 


10.601 


2228 


SER294 


C 


-24.198 


11.766 


9.675 


2229 


SER294 


o 


-25 386 


11.443 


9.529 


2230 


SER294 


CB 


-22.167 


11.822 


11.164 


2231 


SER294 


OG 


-20.976 


11.559 


10.426 


2232 


ARG295 


N 


-23.598 


12.789 


9.076 


2233 


ARG295 


CA 


-24.233 


13.782 


8.187 


2234 


ARG295 


C 


-25.738 


13.903 


8.366 


2235 


ARG295 


o 


-26.524 


13.3 


7.627 


2236 


ARG295 


CB 


-23.905 


13.424 


6.747 


2237 


ARG295 


CG 


-22.4 


13.455 


6.518 


2238 


ARG295 


CD 


-21.819 


14.841 


6.786 


2239 


ARG295 


NE 


-20.362 


14.859 


6.577 


2240 


ARG295 


CZ 


-19.475 


14.679 


7.56 


2241 


ARG295 


NH1 


-19.892 


14.437 


8.805 


2242 


ARG295 


NH2 


-18.167 


14.719 


7.293 


2243 


PHE296 


N 


-26.119 


14.646 


9.387 


2244 


PHE296 


CA 


-27.528 


14.825 


9.712 


2245 


PHE296 


c 


-28.176 


15.754 


8.701 


2246 


PHE296 


O 


-27.736 


16.899 


8.529 


2247 


PHE296 


CB 


-27.623 


15.428 


11.11 


2248 


PHE296 


CG 


-29.052 


15.695 


11.562 


2249 


PHE296 


CD1 


-29.564 


16.985 


11.533 


2250 


PHE296 


CD2 


-29.843 


14.643 


12.002 


2251 


PHE296 


CE1 


-30.871 


17.221 


11.936 


2252 


PHE296 


CE2 


-31.149 


14.879 


12.406 


2253 


PHE296 


CZ 


-31.663 


16.167 


12.37 


2254 


ALA297 


N 


-29.2 


15.255 


8.032 


2255 


ALA297 


CA 


-29.931 


16.073 


7.065 


2256 


ALA297 


C 


-30.783 


17.111 


7.781 


2257 


ALA297 


0 


-31.769 


16.781 


8.45 


2258 


ALA297 


CB 


-30.819 


15.165 


6.224 


.2259 


THR298 


N 


-30.369 


18.361 


7.668 


2260 


THR298 


CA 


-31.129 


19.47 


8.255 


2261 


THR298 


C 


-32.139 


20.005 


7.246 


2262 


THR298 


O 


-33.087 


20.712 


7.603 


2263 


THR298 


CB 


-30.17 


20.586 


8.659 


2264 


THR298 


OG1 


-29.565 


21.12 


7.488 


2265 


THR298 


CG2 


-29.068 


20.082 


9.582 


2266 


ALA299 


N 


-31.926 


19.649 


5.992 



-97- 



WO 2004/061116 



2267 


ALA299 


CA 


2268 


ALA299 


C 


2269 


ALA299 


0 


2270 


ALA299 


CB 


2271 


ASP300 


N 


2272 


ASP300 


CA 


2273 


ASP300 


C 


2274 


ASP300 


0 


2275 


ASP300 


CB 


2276 


ASP300 


CG 


2277 


ASP300 


OD1 


2278 


ASP300 


OD2 


2279 


VAL301 


N 


2280 


VAL301 


CA 


2281 


VAL301 


C 


2282 


VAL301 


o 


2283 


VAL301 


CB 


2284 


VAL301 


CG1 


2285 


VAL301 


CG2 


2286 


GLU302 


N 


2287 


GLU302 


CA 


2288 


GLU302 


c 


2289 


GLU302 


o 


2290 


GLU302 


CB 


2291 


GLU302 


CG 


2292 


GLU302 


CD 


2293 


GLU302 


OE1 


2294 


GLU302 


OE2 


2295 


ILE303 


N 


2296 


ILE303 


CA 


2297 


ILE303 


c 


2298 


ILE303 


O 


2299 


ILE303 


CB 


2300 


ILE303 


CG1 


2301 


ILE303 


CG2 


2302 


ILE303 


CD1 


2303 


GLY304 


N 


2304 


GLY304 


CA 


2305 


GLY304 


C 


2306 


GLY304 


0 


2307 


GLY305 


N 


2308 


GLY305 


CA 


2309 


GLY305 


C 


2310 


GLY305 


0 


2311 


THR306 


N 


2312 


THR306 


CA 


2313 


THR306 


C 


2314 


THR306 


O 


2315 


THR306 


CB 


2316 


THR306 


OG1 


2317 


THR306 


CG2 


2318 


LEU307 


N 


2319 


LEU307 


CA 


2320 


LEU307 


C 



PCT/US2003/034082 



-32.881 


19.968 


4.932 


-33.091 


18.734 


4.068 


-32.427 


17.712 


4.27 


-32.331 


21.11 


4.086 


-34.041 


18.817 


3.151 


-34.256 


17.727 


2.191 


-33.253 


17.852 


1.048 


-33.498 


18.545 


0.053 


-35.679 


17.801 


1.648 


-36.682 


17.714 


2.793 


-36.686 


16.69 


3.463 


-37.289 


18.733 


3.09 


-32.128 


17.177 


1.199 


-31.029 


17.337 


0.244 


-31.065 


16.261 


-0.833 


-30.956 


15.062 


-0.55 


-29.71 1 


17.263 


1.007 


-28.543 


17.673 


0.118 


-29.756 


18.154 


2.24 


-31.242 


16.695 


-2.067 


-31.23 


15.75 


-3.184 


-29.799 


15.413 


-3.603 


-29.141 


16.178 


-4.318 


-31.98 


16.355 


-4.363 


-32.053 


15.344 


-5.497 


-32.662 


15.959 


-6.75 


-33.484 


16.852 


-6.606 


-32.169 


15.628 


-7.82 


-29.365 


14.227 


-3.222 


-28.014 


13.766 


-3.539 


-28.053 


12.812 


-4.726 


-28.452 


11.647 


-4.6 


-27.438 


13.076 


-2.307 


-27.329 


14.056 


-1.148 


-26.066 


12.49 


-2.608 


-26.305 


15.146 


-1.441 


-27.715 


13.356 


-5.885 


-27.705 


12.592 


-7.139 


-29.056 


11.935 


-7.402 


-29.187 


10.708 


-7.311 


-30.079 


12.757 


-7.573 


-31.432 


12.229 


-7.805 


-32.226 


11.982 


-6.517 


-33.289 


12.584 


-6.314 


-31.694 


11.132 


-5.653 


-32.399 


10.717 


-4.431 


-32.569 


11.859 


-3.43 


-31.597 


12.497 


-3.015 


-31.591 


9.598 


-3.782 


-31.461 


8.534 


-4.716 


-32.278 


9.051 


-2.537 


-33.811 


12.127 


-3.066 


-34.093 


13.162 


-2.068 


-33.971 


12.6 


-0.652 



-98- 



WO 2004/061116 



PCT/US2003/034082 



2321 


LEU307 


0 


-34.803 


11.799 


-0.209 


2322 


LEU307 


CB 


-35.513 


13.67 


-2.295 


2323 


LEU307 


CG 


-35.845 


14.852 


-1.391 


2324 


LEU307 


CD1 


-34.946 


16.039 


-1.71 


2325 


LEU307 


CD2 


-37.31 


15.248 


-1.534 


2326 


ILE308 


N 


-32.93 


13.021 


0.045 


2327 


ILE308 


CA 


-32.738 


12.616 


1.439 


2328 


ILE308 


C 


-33.586 


13.486 


2.362 


2329 


ILE308 


O 


-33.481 


14.719 


2.356 


2330 


ILE308 


CB 


-31.253 


12.742 


1.764 


2331 


ILE308 


CG1 


-30.456 


11.77 


0.901 


2332 


ILE308 


CG2 


-30.976 


12.507 


3.245 


2333 


ILE308 


CD1 


-28.974 


11.794 


1.246 


2334 


ARG309 


N 


-34.466 


12.834 


3.101 


2335 


ARG309 


CA 


-35.386 


13.543 


3.993 


2336 


ARG309 


c 


-34.674 


14.125 


5.209 


2337 


ARG309 


o 


-33.807 


13.486 


5.824 


2338 


ARG309 


CB 


-36.447 


12.563 


4.474 


2339 


ARG309 


CG 


-37.117 


11.841 


3.312 


2340 


ARG309 


CD 


-38.079 


10.778 


3.83 


2341 


ARG309 


NE 


-38.721 


10.055 


2.721 


2342 


ARG309 


CZ 


-40.046 


9.951 


2.594 


2343 


ARG309 


NH1 


-40.85 


10.528 


3.49 


2344 


ARG309 


NH2 


-40.566 


9.278 


1.566 


2345 


ALA310 


N 


-35.099 


15.325 


5.566 


2346 


ALA310 


CA 


-34.597 


15.999 


6.763 


2347 


ALA310 


c 


-34.952 


15.189 


8.003 


2348 


ALA310 


0 


-36.005 


14.542 


8.07 


2349 


ALA310 


CB 


-35.223 


17.386 


6.853 


2350 


GLY311 


N 


-34 


15.115 


8.913 


2351 


GLY311 


CA 


-34.172 


14.325 


10.13 


2352 


GLY311 


C 


-33.475 


12.968 


10.042 


2353 


GLY311 


O 


-33.472 


12.217 


11.023 


2354 


GLU312 


N 


-32.982 


12.611 


8.866 


2355 


GLU312 


CA 


-32.294 


11.322 


8.719 


2356 


GLU312 


C 


-30.776 


11.471 


8.737 


2357 


GLU312 


O 


-30.236 


12.577 


8.599 


2358 


GLU312 


CB 


-32.743 


10.651 


7.429 


2359 


GLU312 


CG 


-34.243 


10.386 


7.453 


2360 


GLU312 


CD 


-34.639 


9.537 


6.252 


2361 


GLU312 


OE1 


-34.686 


10.083 


5.156 


2362 


GLU312 


OE2 


-34.723 


8.33 


6.419 


2363 


GLY313 


N 


-30.107 


10.356 


8.981 


2364 


GLY313 


CA 


-28.64 


10.335 


8.994 


2365 


GLY313 


C 


-28.079 


9.813 


7.673 


2366 


GLY313 


O 


-28.674 


8.944 


7.02 


2367 


VAL314 


N 


-26.996 


10.429 


7.237 


2368 


VAL314 


CA 


-26.33 


10.028 


5.994 


2369 


VAL314 


C 


-24.93 


9.467 


6.267 


2370 


VAL314 


O 


-24.127 


10.02 


7.033 


2371 


VAL314 


CB 


-26.28 


11.246 


5.073 


2372 


VAL314 


CG1 


-25.639 


10.942 


3.724 


2373 


VAL314 


CG2 


-27.681 


11.804 


4.861 


2374 


VAL315 


N 


-24.674 


8.328 


5.649 



-99- 



WO 2004/061116 



PCT/US2003/034082 



2375 


VAL315 


CA 


co.oou 


7 64fi 


C 77Q 
Z>.( I O 


2376 


VAL315 


c 


-PP 647 




A /lOft 
*f.'fOO 


2377 


VAL315 


o 


-p*. ftp*? 


6 Q7^ 


Q /17Q 

o.H/y 


2378 


VAL315 


CB 


-P^ fi°,4 


fi P07 


ft 01 O 

o.^lo 


2379 


VAL315 


CG1 


-PP 3P 1 ? 


^ 437 


ft Q/17 


2380 


VAL315 


CG2 




ft 1ft^ 
O. IOO 


7 COO 


2381 


GLY316 


N 


~t 1 .o 


ft APC7 
O.HO/ 


A *Y7A 
4.0/4 


2382 


GLY316 


CA 


-PO 7AQ 

~cL\J. / 057 


ft C\^Q 

o.ooy 


O 1 CO 

o.lo2 


2383 


GLY316 


C 


-1 Q 7Q7 


7 ojio 

/ .OO^ 


o.Uyo 


2384 


GLY316 


o 


lo.yuo 


7 PA 


o a a a 
o.y4o 


2385 


LFU317 


N 


10 Q7A 

-iy.y/o 


ft RAO 
O.OUo 


O "i A4 

2.121 


2386 


LPLT317 


HA 


10 1AO 

-iy.iu^ 


0.O4o 


1.978 


2387 


LFU^17 




-1 / .o41 


D./22 


1.218 


2388 


I FU^17 

1— L- WO 1 I 


n 


17 7CQ 
-1 / ./DO 


o.4oo 


0.009 




1 PI 1<*17 


PR 


■4 0 OOC 

-1 y.ooo 


4.20O 


1.199 






p/^ 

ova 


OA O^O 

-2U,ol2 


o API A 

O.094 


2.064 


P^QI 

CO<7 I 


1 PI IQ17 


PHI 
OUT 


1 A 4 

-19.129 


2.374 


2.697 


P3QP 


I PI I^IT 


OU*l 


-21 .328 


3.519 


3.121 


OOQO 
cOJJO 


QPROlo 
ocnolo 




-1 6.794 


6.056 


1.951 


P^Qd. 

4&Oi7*T 


ocnolo 


OA 


H COO 

-15.563 


6.559 


1.34 




ocnolo 


p 
O 


-1 4.789 


5.483 


0.588 


P'Vift 

<£0570 


ocnolo 




-1 4.286 


5.777 


-0.503 


toy / 


QCD01 p 

Ocrlolo 


ot> 


-14.689 


7.158 


2.434 




ocnolo 




-1 3.375 


7.32 


1.915 




ACM01O 

Aorvioiy 


M 

N 


-14.954 


4.222 


0.954 




MOlViOl y 


pa 


-14.267 


3.196 


0.162 




MoiNoiy 


p 
O 


-15.089 


2.774 


-1 .058 




MOIMO iy 


r\ 

\J 


•4 >f ins 

-1 4.498 


2.343 


-2.052 




AON1Q1Q 
MOINO 1 y 


OD 


-13.899 


1.982 


1.004 


2404 


MOIMO 1 


pfi 
ova 


■4 O E01 


1.507 


0.516 


2405 


MOIMO 1 57 


oni 

UUl 


H i COO 

-11 .o2o 


O OOO 

2.000 


0.321 


2406 


/AOIMO 1 57 


MHO 


1 O Q7D 

-12.0/0 


A orvo 
0.209 


0.332 


2407 


ai A°,pn 


M 


-lo.oo^ 


O 1 OA 

o.l 39 


-1 .091 


2408 


AI A^PO 


pA 
OA 


17 1Q1 

-1 /.lol 


O QQQ 

2.oo9 


-2.283 


2409 


Al A^Pft 


p 
O 


17 ATM 
-1 / .UUl 


4.UOO 


O Oil o 

-3.248 


2410 


ALA3P0 




1ft AQ7 

-lo.oy/ 


o QCO 
O.OOD 


A A a A 

-4.4o4 


2411 


ALA3PO 


HR 


1p ft40 


O 7fJ 


-1.875 


2412 


GLY321 


N 


-1ft ftA 


1Q7 


O ftftft 
-il.OOO 


2413 


GLY321 


CA 


- I o.^oo 


ft Oft 

o.oo 


o Aoa 


2414 


GLY321 


c 


-1 A QAO 


ft HdR 


~A 17 


2415 


GLY321 


o 


-1 A QOA 


ft inft 

O. I uo 


C ^A>( 
-D.4U4 


2416 


ASN322 


N 


1 0.57s?0 


^ <%nft 

O.OUO 


O vlftft 

-0.400 


2417 


ASN322 


CA 


-1 P ftft^ 


S 17P 

O. 1 / cL 


,4 110 
-4. I lo 


2418 


ASN322 


c 


-1P 76*5 


3 QP4 


-R AAR 

-o.uuo 


2419 


ASN322 


o 


-11 QS4 


^ 7ftQ 


_C 007 


2420 


ASN322 


CB 


-11 651 




o noc 
-O.UOO 


2421 


ASN322 


CG 


-11.54 


6.013 


-1 QQ4 

1 .1717*+ 


2422 


ASN322 


OD1 


-11.701 


7.211 


-2.278 


2423 


ASN322 


ND2 


-11.095 


5.595 


-0.824 


2424 


HIS323 


N 


-13.785 


3.103 


-4.807 


2425 


HIS323 


CA 


-14.068 


1.985 


-5.716 


2426 


HIS323 


C 


-15.037 


2.337 


-6.846 


2427 


HIS323 


O 


-15.541 


1.435 


-7.525 


2428 


HIS323 


CB 


-14.625 


0.804 


-4.939 



-100- 



WO 2004/061116 



PCTAJS2003/034082 



2429 


HIS323 


CG 


-13,581 


0.008 


-4.19 


2430 


HIS323 


ND1 


-13.802 


-0.807 


-3.144 


2431 


HIS323 


CD2 


-12.233 


-0.04 


-4.46 


2432 


HIS323 


CE1 


-12.634 


-1.351 


-2.748 


2433 


HIS323 


NE2 


-11.664 


-0.875 


-3.563 


2434 


ASP324 


N 


-15.353 


3.608 


-7.007 


2435 


ASP324 


CA 


-16.209 


4.041 


-8.109 


2436 


ASP324 


c 


-15.409 


4.076 


-9.411 


2437 


ASP324 


o 


-14.628 


5.012 


-9.642 


2438 


ASP324 


CB 


-16.72 


5.431 


-7.737 


2439 


ASP324 


CG 


-17.711 


6.002 


-8 74 


2440 


ASP324 


OD1 


-18.738 


6.476 


-8.275 


2441 


ASP324 


OD2 


-17.28 


6.257 


-9 859 


2442 


PR0325 


N 


-15.831 


3 237 


-10 34Q 


2443 


PR0325 


CA 


-15.082 


2 Q8Q 


-11 5Q5 


2444 


PR0325 


c 


-15 201 


4 OQQ 


-12 64ft 


2445 


PR0325 


o 


-14 504 




-1 °. fifi7 


2446 


PR0325 


CB 


-15 653 


1 713 


-12 1*34 


2447 


PR0325 


CG 


-16 Q 


1 346 


-11 °.>1(C 
- 1 1 .OHO 


2448 


PR0325 


CD 


-17 01 


2 374 


-10 2°.2 


2449 


ASP326 


N 


-1 ft OOft 


5 11 


-1 2 °.61 


2450 


A^P326 


CA 


lu.cUO 






2451 


ASP326 

nor ocu 




-1*1 1Afi 

I«J. t*T\J 


7 °.12 


-12 QAQ 


2452 


ASP326 

nor \JC-\J 


o 


-1 d Q2d 


ft 21 ft 


-1 °. 7Q7 


2453 


ASP326 


CB 


-17 5Q7 


6 ftPfi 


-1 2 QCM. 


2454 


ASP326 

nor \jckj 




-1ft 675 


D. / vlO 


.10 1CC 
I O. IOO 


2455 


ASP326 


OD1 


-1Q 125 


5 571 
j.jf i 


-14. 277 


2456 


ASP326 


0D2 


-1 ft Q63 


5 Oftft 


-12 17Q 


2457 


GLY327 


(S{ 


-14 46ft 


7 162 


-11 ft57 
— I I .ou/ 


2458 


GLY327 


CA 


-13 318 


ft 005 


-11 542 


2459 


GLY327 


c 


-12.061 


7 145 


-11 5Q4 


2460 


GLY327 


o 


-11.166 


7.366 


-12 41ft 


2461 


PHE328 


N 


-12.046 


6 12 


-10 76 


2462 


PHE328 


CA 


-10.893 


5.218 


-10 705 


2463 


PHE328 


c 


-11.119 


4.009 


-11 .602 


2464 


PHE328 


o 


-11.92 


3.114 


-11.3 


2465 


PHE328 


CB 


-10.653 


4.81 


-9.257 


2466 


PHE328 


CG 


-10.261 


5.995 


-8.376 


2467 


PHE328 


CD1 


-9.098 


6.702 


-8.651 


2468 


PHE328 


CD2 


-11.07 


6.379 


-7.314 


2469 


PHE328 


CE1 


-8.737 


7.784 


-7.858 


2470 


PHE328 


CE2 


-10.709 


7.459 


-6.519 


2471 


PHE328 


CZ 


-9.542 


8.161 


-6.791 


2472 


GLU329 


N 


-10.377 


4.006 


-12.696 


2473 


GLU329 


CA 


-10.511 


2.993 


-13.752 


2474 


GLU329 


C 


-10.048 


1.63 


-13.253 


2475 


GLU329 


O 


-8.847 


1.399 


-13.092 


2476 


GLU329 


CB 


-9.673 


3.409 


-14.962 


2477 


GLU329 


CG 


-10.104 


4.752 


-15.557 


2478 


GLU329 


CD 


-9.106 


5.864 


-15.215 


2479 


GLU329 


OE1 


-8.54 


5.804 


-14.132 


2480 


GLU329 


OE2 


-8.904 


6.724 


-16.059 


2481 


ASN330 


N 


-10.987 


0.698 


-13.249 


2482 


ASN330 


CA 


-10.826 


-0.597 


-12.566 



-101- 



WO 2004/061116 



PCT/US2003/034082 



2483 


ASN330 


C 


-10.252 


-0.375 


-11.171 


2484 


ASN330 


O 


-9.065 


-0.629 


-10.929 


2485 


ASN330 


CB 


-9.925 


-1.547 


-13.355 


2486 


ASN330 


CG 


-9.943 


-2.959 


-12.75 


2487 


ASN330 


OD1 


-9.958 


-3.151 


-11.523 


2488 


ASN330 


ND2 


-9.893 


-3.939 


-13.632 


2489 


PR0331 


N 


-11.157 


-0.201 


-10.224 


2490 


PR0331 


CA 


-10.787 


0.129 


-8.843 


2491 


PR0331 


c 


-10.287 


-1.061 


-8.012 


2492 


PR0331 


O 


-9.994 


-0.904 


-6.822 


2493 


PR0331 


CB 


-12.044 


0.673 


-8 25 


2494 


PR0331 


CG 


-13.21 


0.356 


-9 172 


2495 


PR0331 


CD 


-12.608 


-0.26 


-10 416 


2496 


ASP332 


N 


-10.198 


-2.236 


-8 81^ 


2497 


ASP332 


CA 


-9.742 


-3 417 


-7 88Q 


2498 


ASP332 


c 


-8.267 


-3 667 


-8 17Q 


2499 


ASP332 


o 


-7 609 


-4 447 


-7 47ft 


2500 


ASP332 


CB 


-10.557 


-4 617 


-8 *V?ft 


2501 


ASP332 


CG 


-12 048 


-4 3^6 


-8 171 


2502 


ASP332 


OD1 


-1P 43 


-3 Q61 

0.570 1 


-7 Oft 


2503 


ASP332 


OD2 


-1 P 784 


-4 RfiK 


-Q 197 


2504 


THR333 


N 


-7 744. 


-P Q7 


_Q -170 


2505 


THR333 

1111 IV,-* w 


CA 




-o. i >?o 


-y.000 


2506 


THR333 


c 


433 


-P PP4 


-ft 7i5 
0. / 00 


2507 


THR333 


O 


-5 P83 


-1 04P 




2508 


THR333 


CB 


-6 18Q 


-P 877 


-11 OPft 


2509 


THR333 


OG1 


-7 07P 


-3 748 


-11 71ft 


2510 


THR333 


CG2 


-A 7RQ 


-3 147 


-11 K'id 


2511 


PHE334 


N 


-4 84 


-P 77P 


-7 fiftft 


2512 


PHE334 


CA 


-3 868 


-2 017 


-8 8Q 


2513 


PHE334 


c 


-2.589 


-1 762 


-7 684 


2514 


PHE334 


o 


-1.817 


-2.686 


-7 Q7Q 


2515 


PHE334 


CB 


-3.543 


-2.812 


61 


2516 


PHE334 


CG 


-2.485 


-2.162 


-4 744 


2517 


PHE334 


CD1 


-2.784 




-4.047 


2518 


PHE334 


CD2 


-1.221 


-2.728 


-4.638 


2519 


PHE334 


CE1 


-1 .82 


-0.404 


-3.246 


2520 


PHE334 


CE2 


-0.256 


-2.132 


-3.837 


2521 


PHE334 


CZ 


-0.556 


-0.969 


-3.142 


2522 


ASP335 


N 


-2.407 


-0.509 


-8.064 


2523 


ASP335 


CA 


-1.206 


-0.106 


-8.794 


2524 


ASP335 


C 


-0.697 


1.24 


-8.283 


2525 


ASP335 


O 


-1.322 


2.277 


-8.525 


2526 


ASP335 


CB 


-1.55 


-0.019 


-10.277 


2527 


ASP335 


CG 


-0.276 


0.2 


-11.084 


2528 


ASP335 


OD1 


0.34 


-0.792 


-11.448 


2529 


ASP335 


OD2 


0.15 


1.347 


-11.159 


2530 


ILE336 


N 


0.532 


1.249 


-7.794 


2531 


ILE336 


CA 


1.118 


2.426 


-7.127 


2532 


ILE336 


C 


1.596 


3.552 


-8.058 


2533 


ILE336 


O 


2.207 


4.512 


-7.579 


2534 


ILE336 


CB 


2.303 


1.948 


-6.301 


2535 


ILE336 


CG1 


3.437 


1.48 


-7.203 


2536 


ILE336 


CG2 


1.873 


0.817 


-5.375 



-102- 



WO 2004/061116 



PCT/US2003/034082 



2537 


ILE336 


CD1 


4.676 


1.131 


-6.396 


2538 


GLU337 


N 


1.41 


3.407 


-9.361 


2539 


GLU337 


CA 


1.712 


4.496 


-10.291 


2540 


GLU337 


C 


0.434 


5.236 


-10.681 


2541 


GLU337 


0 


0.487 


6.246 


-11.393 


2542 


GLU337 


CB 


2.393 


3.936 


-11.533 


2543 


GLU337 


CG 


3.788 


3.412 


-11.213 


2544 


GLU337 


CD 


4.442 


2.871 


-12.481 


2545 


GLU337 


0E1 


3.934 


3.167 


-13.552 


2546 


GLU337 


0E2 


5.372 


2.089 


-12.348 


2547 


ARG338 


N 


-0.697 


4.719 


-10.229 


2548 


ARG338 


CA 


-1.982 


5.376 


-10.464 


2549 


ARG338 


C 


-2.095 


6.636 


-9.615 


2550 


ARG338 


0 


-1.809 


6.625 


-8.412 


2551 


ARG338 


CB 


-3.085 


4.393 


-10.082 


2552 


ARG338 


CG 


-4.484 


4.977 


-10 234 


2553 


ARG338 


CD 


-5.532 


3.961 




2554 


ARG338 


NE 


-5.375 


2.729 


-10.591 


2555 


ARG338 


CZ 


-6.357 


1.847 


-10 77 


2556 


ARG338 


NH1 


-7.549 


2.056 


-in P07 


2557 


ARG338 


NH2 


-6.143 


0.757 


-11 509 


2558 


GLY339 


N 


-2.423 


7.735 


-10 271 


2559 


GLY339 


CA 


-2.685 


6.98 




2560 


GLY339 


c 


-4.038 


8.895 


-ft RS? 


2561 


GLY339 


0 


-5.086 


9.045 


-9 489 


2562 


ALA340 


N 


-3.994 


8.767 


-7 534 


2563 


ALA340 


CA 


-5.202 


8.646 


-6 691 


2564 


ALA340 


C 


-5.871 


9.982 


-6.335 


2565 


ALA340 


O 


-6.419 


10.145 


-5.237 


2566 


ALA340 


CB 


-4.817 


7.907 


-5.414 


2567 


ARG341 


N 


-5.844 


10.917 


-7.27 


2568 


ARG341 


CA 


-6.395 


12.25 


-7.043 


2569 


ARG341 


C 


-7.904 


12.179 


-6.87 


2570 


ARG341 


0 


-8.571 


11.307 


-7.436 


2571 


ARG341 


CB 


-6.045 


13.131 


-8.235 


2572 


ARG341 


CG 


-4.535 


13.264 


-8.387 


2573 


ARG341 


CD 


-4.175 


14.135 


-9.583 


2574 


ARG341 


NE 


-2.718 


14.303 


-9.691 


2575 


ARG341 


CZ 


-2.142 


15.463 


-10.016 


2576 


ARG341 


NH1 


-2.898 


16.534 


-10.271 


2577 


ARG341 


NH2 


-0.813 


15.551 


-10.092 


2578 


HIS342 


N 


-8.378 


13.009 


-5.954 


2579 


HIS342 


CA 


-9.801 


13.12 


-5.603 


2580 


HIS342 


C 


-10.342 


11.888 


-4.88 


2581 


HIS342 


0 


-11.547 


11.621 


-4.961 


2582 


HIS342 


CB 


-10.644 


13.367 


-6.855 


2583 


HIS342 


CG 


-10.32 


14.641 


-7.606 


2584 


HIS342 


ND1 


-10.757 


15.877 


-7.303 


2585 


HIS342 


CD2 


-9.536 


14.758 


-8.731 


2586 


HIS342 


CE1 


-10.259 


16.758 


-8.195 


2587 


HIS342 


NE2 


-9.504 


16.064 


-9.078 


2588 


HIS343 


N 


-9.49 


11.15 


-4.184 


2589 


HIS343 


CA 


-10.02 


10.075 


-3.346 


2590 


HIS343 


C 


-10.572 


10.661 


-2.053 



-103- 



WO 2004/061116 



PCT/US2003/034082 



2591 


HIS343 


O 


-9.984 


11.572 


-1.455 


2592 


HIS343 


CB 


-8.988 


8.965 


-3.085 


2593 


HIS343 


CG 


-7.691 


9.253 


-2.333 


2594 


HIS343 


ND1 


-7.374 


10.313 


-1.562 


2595 


H1S343 


CD2 


-6.597 


8.421 


-2.319 


2596 


HIS343 


CE1 


-6.125 


10.166 


-1.079 


2597 


HIS343 


NE2 


-5.643 


8.993 


-1.549 


2598 


VAL344 


N 


-11.634 


10.052 


-1.563 


2599 


VAL344 


CA 


-12.278 


10.521 


-0.331 


2600 


VAL344 


C 


-11.732 


9.82 


0.915 


2601 


VAL344 


O 


-12.271 


9.998 


2.015 


2602 


VAL344 


CB 


-13.787 


10.343 


-0.45 


2603 


VAL344 


CG1 


-14.412 


11.457 


-1.282 


2604 


VAL344 


CG2 


-14.141 


8.977 


-1.019 


2605 


ALA345 


N 


-10.581 


9.178 


0.768 


2606 


ALA345 


CA 


-9.947 


8.411 


1.849 


2607 


ALA345 


c 


-9.419 


9.24 


3.02 


2608 


ALA345 


o 


-9.242 


8.703 


4.12 


2609 


ALA345 


CB 


-8.757 


7.686 


1.237 


2610 


PHE346 


N 


-9.169 


10.518 


2.787 


2611 


PHE346 


CA 


-8.763 


11.424 


3.867 


2612 


PHE346 


c 


-9.885 


12.363 


4.291 


2613 


PHE346 


o 


-9.652 


13.29 


5.083 


2614 


PHE346 


CB 


-7.571 


12.253 


3.409 


2615 


PHE346 


CG 


-6.223 


11.583 


3.635 


2616 


PHE346 


CD1 


-5.295 


11.513 


2.605 


2617 


PHE346 


CD2 


-5.921 


11.048 


4.881 


2618 


PHE346 


CE1 


-4.061 


10.916 


2.825 


2619 


PHE346 


CE2 


-4.687 


10.451 


5.101 


2620 


PHE346 


CZ 


-3.756 


10.388 


4.073 


2621 


GLY347 


N 


-11.068 


12.169 


3.731 


2622 


GLY347 


CA 


-12.177 


13.091 


3.979 


2623 


GLY347 


c 


-12.007 


14.367 


3.158 


2624 


GLY347 


o 


-10.882 


14.764 


2.825 


2625 


PHE348 


N 


-13.123 


14.988 


2.822 


2626 


PHE348 


CA 


-13.084 


16.264 


2.097 


2627 


PHE348 


C 


-13.98 


17.307 


2.752 


2628 


PHE348 


O 


-14.185 


17.307 


3.976 


2629 


PHE348 


CB 


-13.497 


16.085 


0.64 


2630 


PHE348 


CG 


-12.375 


15.613 


-0.285 


2631 


PHE348 


CD1 


-12.654 


14.749 


-1.333 


2632 


PHE348 


CD2 


-11.075 


16.06 


-0.081 


2633 


PHE348 


CE1 


-11.633 


14.322 


-2.172 


2634 


PHE348 


CE2 


-10.054 


15.633 


-0.918 


2635 


PHE348 


CZ 


-10.332 


14.763 


-1.963 


2636 


GLY349 


N 


-14.438 


18.227 


1.918 


2637 


• GLY349 


CA 


-15.269 


19.339 


2.376 


2638 


GLY349 


C 


-14.484 


20.194 


3.353 


2639 


GLY349 


0 


-13.286 


20.433 


3.165 


2640 


VAL350 


N 


-15.121 


20.525 


4.458 


2641 


VAL350 


CA 


-14.426 


21.29 


5.493 


2642 


VAL350 


C 


-13.865 


20.408 


6.613 


2643 


VAL350 


O 


-12.964 


20.854 


7.327 


2644 


VAL350 


CB 


-15.384 


22.334 


6.062 



-104- 



WO 2004/061116 



PCT/US2003/034082 



2645 


VAL350 


CG1 


-15.675 


23.433 


5.047 


2646 


VAL350 


CG2 


-16.682 


21.709 


6.561 


2647 


HIS351 


N 


-14.217 


19.133 


6.631 


2648 


HIS351 


CA 


-13.875 


18.287 


7 784 


2649 


HIS351 


c 


-12.75 


17.284 


7 53P 


2650 


HIS351 


o 


-12.711 


16.259 


8 PP3 


2651 


HIS351 


CB 


-15.113 


17.514 


8 P^ 


2652 


HIS351 


CG 


-16.205 


18.349 


8 87 
o.o/ 


2653 


HIS351 


ND1 


-16.137 


18 Q87 


1 VJ.UOH- 


2654 


HIS351 


CD2 


-17 46 


18 f)88 


8 ^ft 

O.OO 


2655 


HIS351 


CE1 


-1 7.304 


1Q 61Q 


m pq 


2656 


HIS351 


NF2 


-18 1PP 


1Q ^7P 


Q P4P 


2657 


GLN352 


IN 


-11 878 


17 


ft Kftft 
o.ooo 


2658 


GLN352 


CA 


-1 n 8f5p 


18 m 8 

IO.O 1 o 


ft pft 
o.^o 


2659 






-y./ *70 


1ft ^8R 
I o.ooo 


7 QR7 
/ .OO/ 




Gl NftRP 


n 


-y./ io 


17 10/* 

1 / . i yo 


ft OQ1 

o.^y i 




Gt NIHP 


wD 


nn 17ft 
- iu. i to 


1ft 7ftR 
1 0. / OO 


4 OOQ 

*f.yz£0 




gi 


HG 




17 Oft 


**.yu^i 


pfifts 


Gl M°jqo 

V3 UINOO^ 




-O.OUo 


17 ftrt*5 
1 I .OUO 


Q 71 O 


OfiftA 




Ut I 


-/.OlO 


iQ COO 

I O.OidO 


o.Ooo 


Pfiftl 
ilOOO 


OLINOOiC 




-O.0/<i 


lO./OO 


<i.yo4 


Pfiftfi 


O TOOOO 


M 


-y.U4y 


i o.^yo 


"7 o*71 


OCC7 
<£OD / 


O TOOOO 


OA 




i A OR A 

l4.yo4 






O TOOOO 




£> poo 

-o.ooo 


io.yo<i 


Q Q1 O 

o.o iy 


PfiftQ 


U TOOOO 




-O. lUl 


•Ift A*70 
lO.U/^ 


/.OOO 




v TOOOO 


PR 


"/ .04C*f 


IQ CIft7 
1 O.OO / 


7 Oft1 

/ .yo i 


Pfi71 


v TOOOO 


Ou 


-o.*m ^ 


HO QiA 

J^.olo 


y. i / 


Pfi7P 


1 PI 

LCUOO*+ 


M 


ft TO A 


1ft ft1 
IO.O I 


O >fft/f 






PA 


c RH7 
-O.OU / 


17 KAQ 


O Rftft 

y.ooo 


2674- 




V-/ 


•4 °*nft 


1ft ft^O 


in neQ 

1 u.uoy 


267^ 






-O.^liif 


17 QCQ 

I / »ooo 


Q 74.7 


2676 




GR 


"O.yo/ 


1ft Rft1 
1 O.OO 1 


in 7^8 

IV/./ MO 


2677 




OG 


-ft 

-O.OUO 


1Q 7 

iy.# 


in inn 

I U. I OO 


2678 


L FU^4 


OD1 


-7 148 
• / • 1 HO 


on 7H1 


11 pftd 


2679 


l FU^54 


ODP 


-R n*>A 


on onft 


q n^7 


2680 


GLY355 


N 


-4 414 


11 fi 


10 478 


2681 


GLY355 

Vila 1 WV#W* 


CA 




14 7Q1 


10 77P 


2682 


GLY355 


c 


-P 88*5 


18 Q^fi 




2683 


GLY355 


o 


-2 119 


12 936 


9 702 


2684 


GLN356 


N 


-3.078 


14.499 


8.393 


2685 


GLN356 


CA 


-2.896 


13.81 


7.119 


2686 


GLN356 


c 


-1 .414 


13.677 


6.799 


2687 


GLN356 


o 


-0.939 


12.562 


6.544 


2688 


GLN356 


CB 


-3.598 


14.696 


6.094 


2689 


GLN356 


CG 


-3.657 


14.117 


4.691 


2690 


GLN356 


CD 


-4.524 


15.024 


3.818 


2691 


GLN356 


OE1 


-4.609 


14.834 


2.599 


2692 


GLN356 


NE2 


-5.231 


15.939 


4.463 


2693 


ASN357 


N 


-0.67 


14.7 


7.191 


2694 


ASN357 


CA 


0.781 


14.7 


7.005 


2695 


ASN357 


C 


1.476 


13.761 


7.989 


2696 


ASN357 


0 


2.328 


12.977 


7.554 


2697 


ASN357 


CB 


1.258 


16.137 


7.203 


2698 


ASN357 


CG 


2.78 


16.244 


7.261 



-105- 



WO 2004/061116 



2699 


ASN357 


OD1 


2700 


ASN357 


ND2 


2701 


LEU358 


N 


2702 


LEU358 


CA 


2703 


LEU358 


C 


2704 


LEU358 


O 


2705 


LEU358 


CB 


2706 


LEU358 


CG 


2707 


LEU358 


CD1 


2708 


LEU358 


CD2 


2709 


ALA359 


N 


2710 


ALA359 


CA 


2711 


ALA359 


c 


2712 


ALA359 


o 


2713 


ALA359 


CB 


2714 


ARG360 


N 


2715 


ARG360 


CA 


2716 


ARG360 


c 


2717 


ARG360 


o 


2718 


ARG360 


CB 


2719 


ARG360 




2720 


ARG360 


CD 


2721 


ARG360 


MP 


2722 


ARG360 


C7 


2723 


ARG360 


NH1 


2724 


ARG360 


NH2 


2725 


LEU361 


N 


2726 


LEU361 


CA 


2727 


LEU361 


c 


2728 


LEU361 


o 


2729 


LEU361 


CB 


2730 


LEU361 


CG 


2731 


LEU361 


CD1 


2732 


LEU361 


CD2 


2733 


GLU362 


N 


2734 


GLU362 


CA 


2735 


GLU362 


C 


2736 


GLU362 


o 


2737 


GLU362 


CB 


2738 


GLU362 


CG 


2739 


GLU362 


CD 


2740 


GLU362 


OE1 


2741 


GLU362 


OE2 


2742 


LEU363 


N 


2743 


LEU363 


CA 


2744 


LEU363 


C 


2745 


LEU363 


O 


2746 


LEU363 


CB 


2747 


LEU363 


CG 


2748 


LEU363 


CD1 


2749 


LEU363 


CD2 


2750 


GLN364 


N 


2751 


GLN364 


CA 


2752 


GLN364 


C 



PCT/US2003/034082 



3.324 


16.808 


8.217 


3.444 


15.745 


6.233 


0.896 


13.592 


9.166 


1.515 


12.727 


10.168 


1.271 


11.264 


9.826 


2.234 


10.487 


9.791 


0.908 


13.034 


11.53 


1.612 


12.261 


12.639 


3.089 


12.639 


12.712 


0.931 


12.493 


13.982 


0.102 


10.986 


9 272 


-0.235 


9.616 


8 flft7 


0.571 


9.151 


7 67Q 


1.204 


8 089 


7 75fi 

/ ml 


-1 723 

1 ■ 1 C-\J 




ft Rfifi 


0.794 


10 045 


ft 79ft 


1 5R5 




5 54ft 


3 078 


q fin*} 


5 ft5** 


3 747 

O. I *T# 


ft fi77 

O.O/ / 


5 *X7A. 


1 354 


m 791 

1 \J.f C 1 


4 45Q 


081 


10 fiftl 


*3 Q54 


-0 ^95 


11 759 

II./ 


9 AQft 

c.tjyo 


-fl 144 


iq OQ7 


o.*tOo 




14. H7ft 


n.ooo 




15 9ft** 




1 045 


1Q QCQ 


1 £Q1 
1 .Do I 




10 T7Q 


ft ft*3 
D.OO 


4 QP1 


IU.OOD 




5 238 


Q 07ft 


ft nn5 


6.174 


8 36 


7 fi9ft 


5 159 


11 5fifi 


ft 1<M 
O. IOO 


6 612 


11 6Q 


ft 579 


7.537 


11 811 


7 365 


6.778 


12 888 


q 40ft 


4.306 


8.658 


8 846 


4.499 


7.431 


9 622 


4.413 


6.2 


8.73 


5.31 


5.352 


8.793 


3.418 


7.347 


10.694 


3.519 


8.493 


11.693 


2.341 


8.447 


12.662 


1.268 


8.901 


12.284 


2.517 


7.891 


13.736 


3.541 


6.253 


7.736 


3.389 


5.125 


6.814 


4.61 


4.946 


5.927 


5.174 • 


3.844 


5.889 


2.184 


5.364 


5.914 


0.977 


4.517 


6.298 


0.35 


4.995 


7.601 


-0.056 


4.54 


5.179 


5.151 


6.047 


5.432 


6.296 


5.955 


4.529 


7.559 


5.544 


5.276 



-106- 



WO 2004/061116 



PCT/US2003/034082 



2753 


GLN364 


O 


8.219 


4.588 


4.843 


2754 


GLN364 


CB 


6.505 


7.308 


3.86 


2755 


GLN364 


CG 


7.624 


7.232 


2.83 


2756 


GLN364 


CD 


7.846 


8.592 


2.181 


2757 


GLN364 


OE1 


7.741 


9.637 


2.835 


2758 


GLN364 


NE2 


8.108 


8.565 


0.886 


2759 


ILE365 


N 


7.707 


6.019 


6.503 


2760 


ILE365 


CA 


8.879 


5.644 


7.298 


2761 


ILE365 


C 


8.835 


4.171 


7.691 


2762 


ILE365 


O 


9.772 


3.438 


7.339 


2763 


ILE365 


CB 


8.94 


6.511 


8.554 


2764 


ILE365 


CG1 


9.202 


7.973 


8.208 


2765 


ILE365 


CG2 


10.012 


5.996 


9.506 


2766 


ILE365 


CD1 


10.539 


8.151 


7.497 


2767 


VAL366 


N 


7.667 


3.695 


8.099 


2768 


VAL366 


CA 


7.555 


2.301 


8.53 


2769 


VAL366 


C 


7.703 


1.335 


7.361 


2770 


VAL366 


O 


8.611 


0.497 


7.417 


2771 


VAL366 


CB 


6.21 


2.076 


9.217 


2772 


VAL366 


CG1 


6.011 


0.606 


9.565 


2773 


VAL366 


CG2 


6.085 


2.923 


10.477 


2774 


PHE367 


N 


7.094 


1.641 


6.225 


2775 


PHE367 


CA 


7.145 


0.701 


5.097 


2776 


PHE367 


C 


8.524 


0.663 


4.453 


2777 


PHE367 


O 


9.066 


-0.434 


4.254 


2778 


PHE367 


CB 


6.125 


1.099 


4.034 


2779 


PHE367 


CG 


4.662 


1.012 


4.458 


2780 


PHE367 


CD1 


3.726 


1.83 


3.841 


2781 


PHE367 


CD2 


4.258 


0.112 


5.435 


2782 


PHE367 


CE1 


2.391 


1.767 


4.216 


2783 


PHE367 


CE2 


2.924 


0.052 


5.813 


2784 


PHE367 


CZ 


1.99 


0.88 


5.206 


2785 


ASP368 


N 


9.187 


1.808 


4.423 


2786 


ASP368 


CA 


10.522 


1.869 


3.83 


2787 


ASP368 


C 


11.52 


1.102 


4.686 


2788 


ASP368 


O 


12.115 


0.138 


4.185 


2789 


ASP368 


CB 


10.964 


3.326 


3.707 


2790 


ASP368 


CG 


10.101 


4.09 


2.701 


2791 


ASP368 


OD1 


10.208 


5.31 


2.678 


2792 


ASP368 


OD2 


9.508 


3.443 


1.847 


2793 


THR369 


N 


11.438 


1.285 


5.995 


2794 


THR369 


CA 


12.381 


0.606 


6.892 


2795 


THR369 


C 


12.07 


-0.88 


7.076 


2796 


THR369 


O 


13.004 


-1.658 


7.287 


2797 


THR369 


CB 


12.376 


1.294 


8.252 


2798 


THR369 


OG1 


11.058 


1.242 


8.778 


2799 


THR369 


CG2 


12.797 


2.756 


8.149 


2800 


LEU370 


N 


10.848 


-1.302 


6.793 


2801 


LEU370 


CA 


10.523 


-2.729 


6.862 


2802 


LEU370 


C 


11.145 


-3.486 


5.703 


2803 


LEU370 


O 


12.038 


-4.319 


5.916 


2804 


LEU370 


CB 


9.012 


-2.919 


6.78 


2805 


LEU370 


CG 


8.302 


-2.463 


8.044 


2806 


LEU370 


CD1 


6.792 


-2.589 


7.882 



-107- 



WO 2004/061116 



PCTAJS2003/034082 



2807 


LEU370 


CD2 


8.79 


-3.256 


9.248 


2808 


PHE371 


N 


10.872 


-3.006 


4.502 


2809 


PHE371 


CA 


11.266 


»3.75 


3.303 


2810 


PHE371 


C 


12.728 


-3.529 


2.918 


2811 


PHE371 


O 


13.332 


-4.398 


2.278 


2812 


PHE371 


CB 


10.333 


-3.34 


2.169 


2813 


PHE371 


CG 


8.861 


-3.634 


2.463 


2814 


PHE371 


CD1 


7.935 


-2.599 


2.503 


2815 


PHE371 


CD2 


8.447 


-4.94 


2.692 


2816 


PHE371 


CE1 


6.601 


-2.867 


2.784 


2817 


PHE371 


CE2 


7.112 


-5.209 


2.971 


2818 


PHE371 


CZ 


6.19 


-4.172 


3.019 


2819 


ARG372 


N 


13.335 


-2.472 


3.436 


2820 


ARG372 


CA 


14.773 


-2.281 


3.231 


2821 


ARG372 


C 


15.606 


-2.901 


4.353 


2822 


ARG372 


O 


16.834 


-2.982 


4.229 


2823 


ARG372 


CB 


15.084 


-0.795 


3.116 


2824 


ARG372 


CG 


14.397 


-0.176 


1.904 


2825 


ARG372 


CD 


14.777 


1.291 


1.76 


2826 


ARG372 


NE 


14.497 


2.015 


3.008 


2827 


ARG372 


CZ 


14.919 


3.257 


3.251 


2828 


ARG372 


NH1 


14.646 


3.835 


4.423 


2829 


ARG372 


NH2 


15.631 


3.911 


2.331 


2830 


ARG373 


N 


14.958 


-3.347 


5.418 


2831 


ARG373 


CA 


15.671 


-4.087 


6.457 


2832 


ARG373 


C 


15.659 


-5.559 


6.09 


2833 


ARG373 


O 


16.694 


-6.238 


6.097 


2834 


ARG373 


CB 


14.937 


-3.926 


7.783 


2835 


ARG373 


CG 


15.657 


-4.637 


8.922 


2836 


ARG373 


CD 


16.912 . 


-3.879 


9.335 


2837 


ARG373 


NE 


16.548 


-2.554 


9.862 


2838 


ARG373 


CZ 


16.405 


-2.304 


11.165 


2839 


ARG373 


NH1 


16.663 


-3.261 


12.059 


2840 


ARG373 


NH2 


16.046 


-1.087 


11.576 


2841 


VAL374 


N 


14.474 


-6.032 


5.749 


2842 


VAL374 


CA 


14.314 


-7.425 


5.338 


2843 


VAL374 


C 


13.644 


-7.499 


3.971 


2844 


VAL374 


O 


12.43 


-7.307 


3.841 


2845 


VAL374 


CB 


13.473 


-8.164 


6.376 


2846 


VAL374 ' 


CG1 


13.297 


-9.625 


5.984 


2847 


VAL374 


CG2 


14.086 


-8.075 


7.77 


2848 


PR0375 


N 


14.432 


-7.873 


2.976 


2849 


PR0375 


CA 


13.929 


-8.036 


1.606 


2850 


PR0375 


C 


13.088 


-9.304 


1.369 


2851 


PR0375 


O 


12.539 


-9.472 


0.275 


2852 


PR0375 


CB 


15.165 


-8.072 


0.76 


2853 


PR0375 


CG 


16.384 


-8.221 


1.658 


2854 


PR0375 


CD 


15.865 


-8.159 


3.084 


2855 


GLY376 


N 


12.945 


-10.158 


2.371 


2856 


GLY376 


CA 


12.162 


-11.386 


2.21 


2857 


GLY376 


C 


11.071 


-11.52 


3.271 


2858 


GLY376 


O 


11.012 


-12.523 


3.992 


2859 


ILE377 


N 


10.225 


-10.508 


3.367 


2860 


ILE377 


CA 


9.092 


-10.568 


4.299 



- 108- 



WO 2004/061116 



PCTAUS2003/034082 



2861 


ILE377 


C 


7.921 


-11.295 


3.645 


2862 


ILE377 


o 


7.217 


-10.731 


2.801 


2863 


ILE377 


CB 


8.663 


-9.148 


4.656 


2864 


ILE377 


CG1 


9.836 


-8.352 


5.203 


2865 


ILE377 


CG2 


7.529 


-9.169 


5.674 


2866 


ILE377 


CD1 


9.433 


-6.919 


5.526 


2867 


ARG378 


N 


7.743 


-12.55 


4.009 


2868 


ARG378 


CA 


6.648 


-13.342 


3.451 


2869 


ARG378 


C 


5.468 


-13.417 


4.409 


2870 


ARG378 


O 


5.629 


-13.304 


5.627 


2871 


ARG378 


CB 


7.186 


-14.734 


3.163 


2872 


ARG378 


CG 


8.265 


-14.671 


2.089 


2873 


ARG378 


CD 


8.975 


-16.01 


1.929 


2874 


ARG378 


NE 


9.756 


-16.33 


3,134 


2875 


ARG378 


CZ 


9.57 


-17.431 


3.864 


2876 


ARG378 


NH1 


8.587 


-18.28 


3.556 


2877 


ARG378 


NH2 


10.338 


-17.659 


4.931 


2878 


ILE379 


N 


4.277 


-13.531 


3.854 


2879 


ILE379 


CA 


3.096 


-13.713 


4.703 


2880 


ILE379 


C 


3.14 


-15.13 


5.272 


2881 


ILE379 


0 


3.519 


-16.07 


4.563 


2882 


ILE379 


CB 


1.841 


-13.536 


3.855 


2883 


ILE379 


CG1 


2.108 


-12.589 


2.692 


2884 


ILE379 


CG2 


0.702 


-12.984 


4.709 


2885 


ILE379 


CD1 


0.882 


-12.447 


1.798 


2886 


ALA380 


N 


2.872 


-15.267 


6.56 


2887 


ALA380 


CA 


2.895 


-16.598 


7.174 


2888 


ALA380 


C 


1.533 


-17.277 


7.07 


2889 


ALA380 


O 


1.435 


-18.51 


7.097 


2890 


ALA380 


CB 


3.306 


-16.471 


8.635 


2891 


VAL381 


N 


0.498 


-16.47 


6.917 


2892 


VAL381 


CA 


-0.839 


-17.005 


6.651 


2893 


VAL381 


C 


-1.231 


-16.745 


5.2 


2894 


VAL381 


O 


-0.782 


-15.768 


4.59 


2895 


VAL381 


CB 


-1.847 


-16.359 


7.599 


2896 


VAL381 


CG1 


-1.705 


-16.897 


9.018 


2897 


VAL381 


CG2 


-1.747 


-14.839 


7.57 


2898 


PR0382 


N 


-1.999 


-17.662 


4.635 


2899 


PR0382 


CA 


-2.615 


-17.424 


3.329 


2900 


PR0382 


C 


-3.477 


-16.166 


3.352 


2901 


PR0382 


O 


-4.045 


-15.802 


4.391 


2902 


PR0382 


CB 


-3.422 


-18.651 


3.039 


2903 


PR0382 


CG 


-3.29 


-19.627 


4.198 


2904 


PR0382 


CD 


-2.414 


-18.938 


5.231 


2905 


VAL383 


N 


-3.721 


-15.621 


2.172 


2906 


VAL383 


CA 


-4.415 


-14.327 


2.051 


2907 


VAL383 


C 


-5.892 


-14.388 


2.452 


2908 


VAL383 


O 


-6.376 


-13.473 


3.126 


2909 


VAL383 


CB 


- -4.302 


-13.886 


0.593 


2910 


VAL383 


CG1 


-5.05 


-12.578 


0.343 


2911 


VAL383 


CG2 


-2.838 


-13.751 


0.177 


2912 


ASP384 


N 


-6.478 


-15.572 


2.355 


2913 


ASP384 


CA 


-7.876 


-15.767 


2.759 


2914 


ASP384 


C 


-8.031 


-15.962 


4.271 



-109- 



WO 2004/061116 



2915 


ASP384 


0 


2916 


ASP384 


CB 


2917 


ASP384 


CG 


2918 


ASP384 


OD1 


2919 


ASP384 


OD2 


2920 


GLU385 


N 


2921 


GLU385 


CA 


2922 


GLU385 


C 


2923 


GLU385 


O 


2924 


GLU385 


CB 


2925 


GLU385 


CG 


2926 


GLU385 


CD 


2927 


GLU385 


OE1 


2928 


GLU385 


OE2 


2929 


LEU386 


N 


2930 


LEU386 


CA 


2931 


LEU386 


C 


2932 


LEU386 


0 


2933 


LEU386 


CB 


2934 


LEU386 


CG 


2935 


LEU386 


CD1 


2936 


LEU386 


CD2 


2937 


PR0387 


N 


2938 


PR0387 


CA 


2939 


PR0387 


C 


2940 


PR0387 


O 


2941 


PR0387 


CB 


2942 


PR0387 


CG 


2943 


PR0387 


CD 


2944 


PHE388 


N 


2945 


PHE388 


CA 


2946 


PHE388 


C 


2947 


PHE388 


0 


2948 


PHE388 


CB 


2949 


PHE388 


CG 


2950 


PHE388 


CD1 


2951 


PHE388 


CD2 


2952 


PHE388 


CE1 


2953 


PHE388 


CE2 


2954 


PHE388 


CZ 


2955 


LYS389 


N 


2956 


LYS389 


CA 


2957 


LYS389 


C 


2958 


LYS389 


0 


2959 


LYS389 


CB 


2960 


LYS389 


CG 


2961 


LYS389 


CD 


2962 


LYS389 


CE 


2963 


LYS389 


NZ 


2964 


HIS390 


N 


2965 


HIS390 


CA 


2966 


HIS390 


C 


2967 


HIS390 


O 


2968 


HIS390 


CB 



PCT/US2003/034082 



-9.156 


-16.094 


4.761 


-8.42 


-17.003 


2.048 


-8.293 


-16.849 


0.534 


-9.1 


-16.128 


-0.032 


-7.312 


-17.346 


-0.002 


-6.926 


-15.995 


5 


-6.994 


-16.177 


6.448 


-6.674 


-14.884 


7.194 


-6.638 


-14.896 


8.429 


-6.01 


-17.259 


6.874 


-6.219 


-18.561 


6.111 


-7.651 


-19.079 


6.248 


-8.017 


-19.462 


7.349 


-8.266 


-19.256 


5.205 


-6.406 


-13.81 


6.463 


-6.082 


-12.519 


7.093 


-7.266 


-11.953 


7 874 


-8.342 


-11.71 


7.315 


-5.676 


-11.542 


5.996 


-4.348 


-11.943 


5.365 


-4.081 


-11.153 


4 OQ1 


-3.204 


-11.773 


6 357 


-7.063 


-11.798 


9.173 


-8.132 


-11.39 


10 091 


-8.419 


-9.89 


10.047 


-7.84 


-9.095 


10 fin*; 


-7.647 


-11.801 


11.445 


-6.191 


-12.224 


1 1 .339 


-5.817 


-12.105 


9 873 


-9.314 


-9.528 


9.143 


-9.775 


-8.145 


9.012 


-10.688 


-7.79 


10.176 


-11.522 


-8.597 


10.603 


-10.558 


-7.999 


7.709 


-9.785 


-8.343 


6.437 


-8.78 


-7.498 


5.987 


-10.097 


-9.492 


5.721 


-8.076 


-7.809 


4.831 


-9.393 


-9.804 


4.565 


-8.381 


-8.963 


4.121 


-10.5 


-6.599 


10.707 


-11.364 


-6.141 


11.792 


-12.626 


-5.52 


11.203 


-12.542 


-4.581 


10.4 


-10.611 


-5.115 


12.633 


-11.439 


-4.708 


13.847 


-10.677 


-3.763 


14.767 


-11.487 


-3.466 


16.023 


-10.719 


-2.637 


16.96 


-13.775 


-6.068 


11.571 


-15.055 


-5.523 


11.102 


-15.385 


^.226 


11.836 


-15.845 


-4.213 


12.983 


-16.162 


-6.548 


11.316 



-110- 



WO 2004/061116 



PCT/US2003/034082 



2969 


HIS390 


CG 


-17.525 


-6.094 


10.826 


2970 


HIS390 


ND1 


-17.895 


-5.893 


9.545 


2971 


HIS390 


CD2 


-18.62 


-5.81 


11.607 


2972 


HIS390 


CE1 


-19.181 


-5.487 


9.511 


2973 


HIS390 


NE2 


-19.629 


-5.437 


10.786 


2974 


ASP391 


N 


-15.053 


-3.138 


11.167 


2975 


ASP391 


CA 


-15.269 


-1.789 


11.683 


2976 


ASP391 


C 


-15.392 


-0.871 


10.48 


2977 


ASP391 


0 


-14.395 


-0.539 


9.835 


2978 


ASP391 


CB 


-14.068 


-1.414 


12.553 


2979 


ASP391 


CG 


-14.172 


-0.02 


13.18 


2980 


ASP391 


OD1 


-14.984 


0.771 


12.707 


2981 


ASP391 


OD2 


-13.241 


0.324 


13.884 


2982 


SER392 


N 


-16.582 


-0.335 


10.283 


2983 


SER392 


CA 


-16.835 


0.448 


9.075 


2984 


SER392 


c 


-16.261 


1.863 


9 129 


2985 


SER392 


0 


-15.99 


2.431 


8 07 


2986 


SER392 


CB 


-18.342 


0.537 


8.868 


2987 


SER392 


OG 


-18.876 


1.383 


9 878 


2988 


THR393 


N 


-15.944 


2.392 


10 PQ7 


2989 


THR393 


CA 


-15.451 


3.768 


10 31^ 


2990 


THR393 


c 


-1 3.931 


3 822 


10 4fift 


2991 


THR393 


o 


-13.294 


4.722 


Q QOQ 


2992 


THR393 


CB 


-16.183 


4.53 


11 411 


2993 


THR393 


OG1 


-17.557 


4.561 


11 04 


2994 


THR393 


CG2 


-15.703 


5.972 


11 *lpft 


2995 


ILE394 


N 


-13.365 


2.816 


11 11ft 
1 I.I 1 O 


2996 


ILE394 


CA 


-11.899 


2.665 


11 POS 


2997 


ILE394 


c 


-11.525 


1.204 


10 QP 


2998 


ILE394 


o 


-11.205 


0.419 


1 1 .824 


2999 


ILE394 


CB 


-11.37 


3.095 


12.577 


3000 


1LE394 


CG1 


-11.744 


4.533 


12.921 


3001 


ILE394 


CG2 


-9.847 


2.978 


12.624 


3002 


ILE394 


CD1 


-10.977 


5.529 


12.055 


3003 


TYR395 


N 


-11.59 


0.854 


9.649 


3004 


TYR395 


CA 


-11.29 


-0.503 


9.179 


3005 


TYR395 


C 


-9.792 


-0.786 


9.302 


3006 


TYR395 


O 


-8.997 


0.149 


9.447 


3007 


TYR395 


CB 


-11.747 


-0.572 


7.721 


3008 


TYR395 


CG 


-11.784 


-1.967 


7.101 


3009 


TYR395 


CD1 


-10.958 


-2.272 


6.026 


3010 


TYR395 


CD2 


-12.648 


-2.927 


7.612 


3011 


TYR395 


CE1 


-10.991 


-3.543 


5.465 


3012 


TYR395 


CE2 


-12.682 


-4.199 


7.052 


3013 


TYR395 


CZ 


-11.852 


-4.502 


5.982 


3014 


TYR395 


OH 


-11.882 


-5.763 


5.427 


3015 


GLY396 


N 


-9.433 


-2.053 


9.421 


3016 


GLY396 


CA 


-8.007 


-2.401 


9.468 


3017 


GLY396 


C 


-7.732 


-3.871 


9.759 


3018 


GLY396 


O 


-8.601 


-4.74 


9.609 


3019 


LEU397 


N 


-6.493 


-4.132 


10.134 


3020 


LEU397 


CA 


-6.031 


-5.497 


10.406 


3021 


LEU397 


C 


-5.334 


-5.577 


11.751 


3022 


LEU397 


O 


-4.297 


-4.938 


11.961 



-111- 



WO 2004/061116 



PCT/US2003/034082 



3023 


LEU397 


CB 


-5.051 


-5.894 


9.311 


3024 


LEU397 


CG 


-5.773 


-6.502 ' 


8.12 


3025 


LEU397 


CD1 


-5.037 


-6.225 


6.822 


3026 


LEU397 


CD2 


-5.979 


-7.996 


8.325 


3027 


HIS398 


N 


-5.87 


-6.402 


12.634 


3028 


HIS398 


CA 


-5.274 


-6.514 


13.967 


3029 


HIS398 


C 


-4.348 


-7.718 


14.107 


3030 


HIS398 


O 


-3.651 


-7.846 


15.12 


3031 


HIS398 


CB 


-6.363 


-6.528 


15.033 


,3032 


HIS398 


CG 


-6.737 


-5.14 


15.525 


3033 


HIS398 


ND1 


-7.052 


-4.804 


16.79 


3034 


HIS398 


CD2 


-6.795 


-3.984 


14.781 


3035 


HIS398 


CE1 


-7.311 


r3.482 


16.851 


3036 


HIS398 


NE2 


-7.152 


-2.975 


15.607 


3037 


ALA399 


N 


-4.306 


-8.567 


13.094 


3038 


ALA399 


CA 


-3.343 


-9.671 


13.12 


3039 


ALA399 


C 


-2.7 


-9.903 


11.756 


3040 


ALA399 


O 


-3.373 


-10.014 


10.724 


3041 


ALA399 


CB 


-4.014 


-10.936 


13.633 


3042 


LEU400 


N 


-1.383 


-10.004 


11.794 


3043 


LEU400 


CA 


-0.567 


-10.204 


10.589 


3044 


LEU400 


C 


0.772 


-10.856 


10.935 


3045 


LEU400 


O 


1.712 


-10.165 


11.35 


3046 


LEU400 


CB 


-0.307 


-8.845 


9.946 


3047 


LEU400 


CG 


0.615 


-8.952 


8.736 


3048 


LEU400 


CD1 


0.005 


-9.826 


7.644 


3049 


LEU400 


CD2 


0.973 


-7.572 


8.197 


3050 


PRO401 


N 


0.815 


-12.178 


10.881 


3051 


PRO401 


CA 


2.084 


-12.9 


10.974 


3052 


PRO401 


C 


2.88 


-12.822 


9.671 


3053 


PRO401 


O 


2.413 


-13.242 


8.602 


3054 


PRO401 


CB 


1.686 


-14.312 


11.269 


3055 


PRO401 


CG 


0.197 


-14.466 


10.993 


3056 


PRO401 


CD 


-0.308 


-13.08 


10.625 


3057 


VAL402 


N 


4.074 


-12.267 


9.78 


3058 


VAL402 


CA 


5.008 


-12.183 


8.653 


3059 


VAL402 


C 


6.358 


-12.814 


8.998 


3060 


VAL402 


O 


7.008 


-12.485 


9.998 


3061 


VAL402 


CB 


5.194 


-10.723 


8.25 


3062 


VAL402 


CG1 


3.968 


-10.185 


7.523 


3063 


VAL402 


CG2 


5.553 


-9.84 


9.44 


3064 


THR403 


N 


6.772 


-13.729 


8.146 


3065 


THR403 


CA 


8.039 


-14.428 


8.342 


3066 


THR403 


C 


9.135 


-13.709 


7.571 


3067 


THR403 


O 


9.102 


-13.66 


6.335 


3068 


THR403 


CB 


7.888 


-15.853 


7.827 


3069 


THR403 


OG1 


6.715 


-16.403 


8.406 


3070 


THR403 


CG2 


9.077 


-16.723 


8.22 


3071 


TRP404 


N 


10.089 


-13.156 


8.298 


3072 


TRP404 


CA 


11.177 


-12.406 


7.66 


3073 


TRP404 


C 


12.136 


-13.344 


6.931 


3074 


TRP404 


O 


12.984 


-12.835 


6.21 


3075 


TRP404 


CB 


11.969 


-11.654 


8.719 


3076 


TRP404 


CG 


11.163 


-10.949 


9.79 



-112- 



WO 2004/061116 



3077 


TRP404 


CD1 


3078 


TRP404 


CD2 


3079 


TRP404 


NE1 


3080 


TRP404 


CE2 


3081 


TRP404 


CE3 


3082 


TRP404 


CZ2 


3083 


TRP404 


CZ3 


3084 


TRP404 


CH2 


3085 


TRP404 


OXT 


3086 


HEM1 


FE 


3087 


HEM1 


NA 


3088 


HEM1 


CIA 


3089 


HEM1 


C2A 


3090 


HEM1 


C3A 


3091 


HEM1 


C4A 


3092 


HEM1 


CHB 


3093 


HEM1 


C1B 


3094 


HEM1 


NB 


3095 


HEM1 


C4B 


3096 


HEM1 


C3B 


3097 


HEM1 


C2B 


3098 


HEM1 


CMB 


3099 


HEM1 


CAB 


3100 


HEM1 


CBB 


3101 


HEM1 


CHC 


3102 


HEM1 


C1C 


3103 


HEM1 


NC 


3104 


HEM1 


C4C 


3105 


HEM1 


C3C 


3106 


HEM1 


C2C 


3107 


HEM1 


CMC 


3108 


HEM1 


CAC 


3109 


HEM1 


CBC 


3110 


HEM1 


CHD 


3111 


HEM1 


C1D 


3112 


HEM1 


ND 


3113 


HEM1 


C4D 


3114 


HEM1 


C3D 


3115 


HEM1 


C2D 


3116 


HEM1 


CMD 


3117 


HEM1 


CAD 


3118 


HEM1 


CBD 


3119 


HEM1 


CGD 


3120 


HEM1 


01D 


3121 


HEM1 


02D 


3122 


HEM1 


CHA 


3123 


HEM1 


CMA 


3124 


HEM1 


CAA 


3125 


HEM1 


CBA 


3126 


HEM1 


CGA 


3127 


HEM1 


01A 


3128 


HEM1 


Q2A 
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10.886 


-11.444 


11.043 


10.559 


-9.637 


9.729 


10.155 


-10.524 


11.721 


9.943 


-9.428 


10.972 


10.506 


-8.656 


8.749 


9.278 


-8.237 


11.225 


9.638 


-7.468 


9.009 


9.226 


-7.257 


10.239 


12.117 


-14.53 


7.239 


-8.08 


12.05 


10.226 


-9.653 


12.085 


9.078 


-10.7 


13.004 


9.077 


-11.687 


12.681 


8.118 


-11.292 


11.525 


7.568 


-10.019 


11.174 


8 1PQ 


-9.224 


10.115 




-7.931 


983 


ft 1ft1 


-7.308 


10 582 


Q 1ftP 


-6.086 


9.964 


Q **RA 

C7.00H- 


-5 946 


8 


ft fJOfi 

O.OUD 


-7 068 


P 771 


7 7AR 


-7 416 


7 7tt 


ft fiftp 


-4 833 


ft 031 


ft RQ1 


-4 44 

ti r t 


7 0^1 


7 7 A 


-5 212 


10 PQft 


1H 17 A 


-5.439 


11 PP3 


11 33ft 


-6.519 


IP 039 


11 IRA 

1 1 .00*r 


-6.227 


12 887 


1P AOR 


-4.926 


12 63ft 


13 OOP 


-4.491 


1 1 .556 


12 313 


-3.265 


10.712 


12 53P 


-4.462 


13.435 


14 055 


-3.452 


13.231 


14.936 


-7.061 


13 855 


12.91 


-8.237 


14.203 


12.292 


-8.777 


13.572 


11.18 


-9.915 


14.313 


10.916 


-10.045 


15.413 


11.808 


-9.006 


15.334 


12.673 


-8.71 


16.241 


13.844 


-11.178 


16.421 


11.802 


-10.91 


17.624 


10.918 


-12.079 


18.574 


10.862 


-13.198 


18.167 


11.204 


-11.889 


19.736 


10.477 


-10.849 


14.026 


9.961 


-12.005 


10.703 


6.498 


-12.907 


13.51 


7.748 


-14.087 


13.112 


8.645 


-15.442 


13.596 


8.14 


-15.522 


14.131 


7.009 


-16.439 


13.4 


8.866 
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What is Claimed is : 

1. An isolated nucleic acid sequence encoding epothilone B hydroxylase 
or a mutant or variant thereof. 

5 

2. The isolated nucleic acid sequence of claim 1 comprising SEQ ID NO: 
1, 30, 32, 34, 36, 37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 72 or 74. 

3. The isolated nucleic acid sequence of claim 1 comprising SEQ ID 

10 NO:l. 

4. The isolated nucleic acid sequence of claim 1 encoding a mutant with 
at least one amino acid substitution in an active site of the epothilone B hydroxylase 
enzyme. 

15 

5. The isolated nucleic acid sequence of claim 1 encoding a mutant with 
at least one amino acid substitution at amino acid GLU31, ARG67, ARG88, ELE92, 
ALA93, VAL106, HJB130, ALA140, MET176, PHE190, GLU 231, SER294, 
PHE237, or ILE365 of SEQ ID NO:2. 

20 

6. The isolated nucleic acid sequence of claim 1 encoding a mutant with 
at least one amino acid substitution at amino acid LEU39, GLN43, ALA45, MET57, 
LEU58, HB62, PHE63, SER64, SER65, ASP66, ARG67, GLN68, SER69, LEU74, 
MET75, VAL76, ALA77, ARG78, GLN79, ILE80, ASP84, LYS85, PR086, PHE87, 

25 ARG88, PR089, SER90, LBU91, ILE92, ALA93, MET94, ASP95, HIS99, ARG103, 
PHE1 10, ILE155, PHE169, GLN170, CYS172, SER173, SER174, ARG175, 
MET176, LEU177, SER178, ARG179, ARG186, PHE190, LEU193, VAL233, 
GLY234, LEU235, ALA236, PHE237, LEU238, LEU239, LEU240, ELE241, 
ALA242, GLY243, HIS244, GLU245, THR246, THR247, ALA248, ASN249, 

30 MET250, LEU283, THR287, HE288, ALA289, GLU290, THR291, ALA292, 
THR293, SER294, ARG295, PHE296, ALA297, THR298, GLU312, GLY313, 
VAL314, VAL315, GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, 
VAL350, HIS351, GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, 
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ALA359, GLU362, LYS389, ASP391, SER392, THR393, DLE394, or TYR395 of 
SEQIDNO:2. 

7. The isolated nucleic acid sequence of claim 1 encoding a variant 
5 comprising SEQ ID NO:43, 44, 45, 46, 47, 48 or 49. 

8. A polypeptide encoded by the isolated nucleic acid sequence of claim 

1 

10 9. An isolated nucleic acid molecule that is capable of hybridizing to a 

nucleic acid sequence of claim 2, or to the complementary sequence of said nucleic 
acid sequence, under hybridization conditions of 3X SSC at 65°C for 16 hours, said 
isolated nucleic acid molecule being capable of remaining hybridized to said nucleic 
acid sequence, or to the complementary sequence of said nucleic acid sequence, under 

15 wash conditions of 0.5X SSC, 55°C for 30 minutes. 

10. An isolated polypeptide comprising SEQ ID NO:2. 

20 1 1 . An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 

ID NO:2 comprising an amino acid sequence with at least one amino acid substitution 
in an active site of epothilone B hydroxylase enzyme of SEQ ID NO:2. 

12. An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 
25 ID NO:2 comprising an amino acid sequence with at least one amino acid substitution 
at amino acid GLU31, ARG67, ARG88, HJB92, ALA93, VAL106, ILE130, 
ALAMO, MET176, PHE190, GLU 231, SER294, PHE237, or BLE365 of SEQ ID 
NO:2. 

30 13. An isolated mutant polypeptide of epothilone B hydroxylase of SEQ 

ID NO:2 comprising an amino acid sequence with at least one amino acid substitution 
at amino acid LEU39, GLN43, ALA45, MET57, LEU58, HES62, PHE63, SER64, 
SER65, ASP66, ARG67, GLN68, SER69, LEU74, MET75, VAL76, ALA77, 
ARG78, GLN79, HJB80, ASP84, LYS85, PR086, PHE87, ARG88, PR089, SER90, 

35 LEU91, DLE92, ALA93, MET94, ASP95, HK99, ARG103, PHE1 10, DLE155, 
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PHE169, GLN170, CYS172, SER173, SER174, ARG175, MET176, LEU177, 
SER178, ARG179, ARG186, PHE190, LEU193, VAL233, GLY234, LEU235, 
ALA236, PHE237, LEU238, LEU239, LEU240, DLE241, ALA242, GLY243, 
HIS244, GLU245, THR246, THR247, ALA248, ASN249, MET250, LEU283, 
THR287, HJB288, ALA289, GLU290, THR291, ALA292, THR293, SER294, 
ARG295, PHE296, ALA297, THR298, GLU312, GLY313, VAL314, VAL315, 
GLY316, VAL344, ALA345, PHE346, GLY347, PHE348, VAL350, HIS351, 
GLN352, CYS353, LEU354, GLY355, GLN356, LEU358, ALA359, GLU362, 
LYS389, ASP391, SER392, THR393, BLE394, or TYR395 of SEQ ID NO:2. 

14. An isolated mutant polypeptide of epothilone B hydroxjdase 
comprising SEQ ID NO: 31, 33, 35, 61, 63, 65, 67, 69, 71, 73 or 75. 

15. An isolated variant polypeptide of epothilone B hydroxylase 
15 comprising SEQ ID NO: 43, 44, 45, 46, 47, 48 or 49. 

16. An isolated nucleic acid sequence encoding a ferredoxin. 

17. The isolated nucleic acid sequence of claim 16 comprising SEQ ID 

20 NO:3. 

18. A polypeptide encoded by the isolated nucleic acid sequence of claim 
16. - 

25 19. An isolated nucleic acid molecule that is capable of hybridizing to the 

nucleic acid sequence set forth in SEQ ID NO:3, or to the complementary sequence of 
the nucleic acid sequence set forth in SEQ ID NO:3, under hybridization conditions of 
3X SSC at 65°C for 16 hours, said isolated nucleic acid molecule being capable of 
remaining hybridized to the nucleic acid sequence set forth in SEQ ID NO:3, or to the 

30 complementary sequence of the nucleic acid sequence set forth in SEQ ID NO:3, 
under wash conditions of 0.5X SSC, 55°C for 30 minutes. 

20. A vector comprising the isolated nucleic acid sequence of claim 1. 

35 21. The vector of claim 20 farther comprising an isolated nucleic acid 

sequence encoding a ferredoxin. 
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22. A host cell comprising the vector of claim 20. 

23. A host cell comprising the vector of claim 21. 

24. A method for producing recombinant microorganisms which 
hydroxylate epothilones having a terminal alkyl group to produce epothilones having 
a terminal hydroxyalkyl group, said method comprising transfecting a microorganism 
with the vector of claim 20 or 21. 

25. A recorubinantly produced microorganism that hydroxylates 
epothilones having a terminal alkyl group to produce epothilones having a terminal 
hydroxyalkyl group. 



15 26. The recombinantly produced microorganism of claim 25 wherein said 

microorganism expresses a nucleic acid sequence of SEQ ID NO: 1, 30, 32, 34, 36, 
37, 38, 39, 40, 41, 42, 60, 62, 64, 66, 68, 72 or 74. 

27. A method for the preparation of at least one epothilone of the 
20 following formula I 

HO-CHz-CAOn-CQJ^CAzVE (I) 

where 

Ai and A 2 are independently selected from the group of optionally substituted C1-C3 
alkyl and alkenyl; 

25 Q is an optionally substituted ring system containing one to three rings and at least 
one carbon to carbon double bond in at least one ring; 

n, m, and o are integers selected from the group consisting of zero and 1, where at 
least one of m or n or o is 1; and 
E is an epothilone core; 
30 comprising the steps of contacting at least one epothilone of the following formula II 

CHa-CAOn-C^^o-E (n) 
where A x , Q, A 2 , E, n, m, and o are defined as above; 
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with a recombinantly produced microorganism, or an enzyme derived therefrom, 
which is capable of selectively catalyzing the hydroxylation of Formula H, and 
effecting said hydroxylation. 

28. A method for the preparation of an epothilone analog of Formula A 



said method comprising biotransforming epothilone B to the epothilone analog of 
Formula A by incubation with a mutant epothilone B hydroxylase enzyme compris 
10 SEQIDNO:31. 



15 or a phannaceutically acceptable salt thereof. 

30. A homology model of epothilone B hydroxylase having a root mean 
square deviation of conserved residue backbone atoms of less than about 4.0 A when 
superimposed on a corresponding backbone atoms described by structure coordinates 
20 listed in Appendix 1. 




OH 



29. A compound of Formula A 




OH 
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31. A method for producing a mutant with altered biological properties, 
function, yield of a desired product, rate of reaction, substrate specificity, or activity 
as compared to epothilone B hydroxylase, said method comprising the steps of: 
identifying an amino acid of SEQ ID NO:2 to mutate; and mutating the identified 

5 amino acid to create a mutant protein. 

32. The method of claim 3 1 wherein a homology model of epothilone B 
hydroxylase having a root mean square deviation of conserved residue backbone 
atoms of less than about 4.0 A when superimposed on a corresponding backbone 

10 atoms described by structure coordinates listed in Appendix 1 is used to identify an 
amino acid of SEQ ID NO: 2 to mutate. 

33. The method of claim 31 wherein the identified amino acid is LETJ39, 
GLN43, ALA45, MET57, LEU58, B3S62, PHE63, SER64, SER65, ASP66, ARG67, 

15 GLN68, SER69, LEU74, MET75, VAL76, ALA77, ARG78, GLN79, ILE80, ASP84, 
LYS85, PRG86, PHE87, ARG88, PR089, SER90, LEU91, ILE92, ALA93, MET94, 
ASP95, HIS99, ARG103, PHE110, ILE155, PHE169, GLN170, CYS172, SER173, 
SER174, ARG175, MET176, LEU177, SER178, ARG179, ARG186, PHE190, 
LEU193, VAL233, GLY234, LEU235, ALA236, PHE237, LEU238, LEU239, 

20 LEU240, 1LE241, ALA242, GLY243, HIS244, GLU245, THR246, THR247, 
ALA248, ASN249, MET250, LEU283, THR287, ILE288, ALA289, GLU290, 
THR291, ALA292, THR293, SER294, ARG295, PHE296, ALA297, THR298, 
GLU312, GLY313, VAL314, VAL315, GLY316, VAL344, ALA345, PHE346, 
GLY347, PHE348, VAL350, HLS351, GLN352, CYS353, LEU354, GLY355, 

25 GLN356, LEU358, ALA359, GLU362, LYS389, ASP391, SER392, THR393, 
ILE394, or TYR395 of SEQ ID NO:2. 

34. The method of claim 31 wherein the identified amino acid is GLU31, 
ARG67, ARG88, HJ392, ALA93, VAL106, ILE130, ALA140, MET176, PHE190, 
30 GLU 231, SER294, PHE237, or 1LE365 of SEQ ID NO:2. 
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35. The method of claim 3 1 wherein the mutant protein improves yield of 
a desired product as compared to the yield of a desired product obtained using 
epothiloneB hydroxylase. 

5 36. The method of claim 35 wherein the desired product is epothilone F. 

37. The method of claim 3 1 wherein the mutant improves the rate of 
reaction as compared to the rate of reaction using epothilone B hydroxylase. 

10 38. The method of claim 3 1 wherein the mutant exhibits altered substrate 

specificity as compared to substrate specificity of epothilone B hydroxylase. 

39. The method of claim 38 wherein amino acid SBR294 is mutated. 

15 40. The method of claim 3 1 wherein the mutant exhibits essentially 

similar biological activity or function to epothilone B hydroxylase. 

41. A machine-readable data storage medium comprising a data storage 
material encoded with structure coordinates set forth in Appendix 1. 

20 



-120- 



WO 2004/061116 



1/5 



PCT/US2003/034082 




WO 2004/061116 



PCT/US2003/034082 



Alignment used to design primers P450-l + and P450-la + 



Alignment used to design primers P450-2 + and P450-2" 



Alignment used to design primer P450-3" 

STMSUACB ttcggcttcggcgtgcaccagtgcctgggc (SEQ ID NO: 17) 

STMSUBCB ttcggcttcggcgtccaccagtgcctggga (SEQ ID NO: 18) 

3702259 ttcggctggggcccccaccactgcctgggc (SEQ ID NO: 19) 

SSU65940 ttcggtcacggcgtccacaagtgtcctggc (SEQ ID NO:20) 

STMOLEP ttcgggcacggagcgcaccactgcatcggc (SEQ ID NO: 21) 

SERCP450A ttcggccacggcatccacttctgcgtgggc (SEQ ID NO: 22) 



STMSUACB 

STMSUBCB 

3702259 

SSU65940 

STMOLEP 

SERCP450A 



tcctcatcgccggccacgagac (SEQ ID NO: 5) 

tgctggtcgccggccacgagac (SEQ ID NO: 6) 

tgctcatcaccggccaggacac (SEQ ID NO: 7) 

--ctgttcgccgggcacgactc (SEQ ID NO: 8) 

tgctcatcgcgggccacgagac (SEQ ID NO: 9) 

tgctggtcgccgggcacgagac (SEQ ID NO: 10) 



STMSUACB 

STMSUBCB 

3702259 

SSU65940 

STMOLEP 

SERCP450A 



cggcgcggtggaggaactgct (SEQ ID NO: 11) 

gggcgccgtcgaggagctgct (SEQ ID NO: 12) 

ccgcaccctggaggagctgct (SEQ ID NO: 13) 

cggcgcggtcgaggagatgct (SEQ ID NO: 14) 

cgcggcggtggaggagatgct (SEQ ID NO: 15) 

cggcgcgatcgaggagaccct (SEQ ID NO: 16) 
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EPO-B MTDVEBTTATLPLARKCPFSPP - - PEYERLRRES PVSRVGLPSGQTAWALTRLEDIREML 

UIKA ATVPDLESDSFHVDWYSTYAELRETAPVTPVRFL -GQBAWLVTGYDEAKAAL 

**:* .* . * .**. * : ** ** ;*:;:* 

EPO-B SSPHFSSD- -RQSPSFPLMVARQI- -RREDKP-FRPSLIAMDPPEHGKAREDVVGEFTVK 

UINA SDLRIiSSDPKKKYPGVEVEFPAYLGFPEDVRNYFATNMGTSDPPT^ 

*. ::*** :: *.. : : .::*..:: *** * : *: * ****. 

EPO-B RMKALQPRIQQIVDEHIDALLAGPKPADLVQALSL^ 

1JINA RVEAMRPRVEQI TAELLDEV- GDSGWD I VDRFAHPLP I KVICELLGVDEAARGAFGRWS 

*::*::**::**. * :* : ... .*:*: :: * : * ******** : : * * 

EPO-B SRMLSREVT- AEERMTAFESLENYLDELVTKKEANATEDDItliGRQ IIiK^ 

UINA SBILVMDPERAEQRGQAAREWNF I LD LVERRRTE PGDDIjLSAL I SVQDDDDGRLSADEL 

*.:* : **:* * ..: * :: : ** : * * . ::: : .*. . .** 

EPO-B VGLAFLLLIAGHETTANMISLGTOT^ 

UINA TS lALVLLLAGFEASVSL IGIGTYLIiLTHPDQLALVRADPSALPNAVE E I LRYIAPPETT 

..:*::**:**.*::..:*.:** ** ****** ::***. * : ** ; ** :; .**. 

EPO-B TSRFATADVE IGGTL IRAGEGWGLSNAGNHDPDGPENPDTFDIERGARHHVAFGFGVHQ 

UINA T-RFAAEEVEIGGVAIPQYSTVLVANGAANRDPSQFPDPHRFDVTRDTRGHLSFGQGIHF 

* ***: :****★. * . *; ..*,*;**. * ;*/**; *.;* *;;** *;* 

EPO-B CLGQNLARLELQIVFOTLFRRVPGIRIAVPVDELPPKHDSTIYGLH^ - 

UINA CMGRPLAKLEGEVALRAL FGRF PALS LGI D ADD WWRRS LLLRG IDHLPVRLDG 

*.*. **.** .... .** * # * m . s- . ... : * : . *** 
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Residue Number 
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SEQUENCE LISTING 
<110> Bristol-Myers Squibb Company 

<120> COMPOSITIONS AND METHODS FOR HYDROXYLATING EPOTHILONES 

<130> D0231 PCT2 

<150> US 10/321,188 

<151> 2002-12-17 

<160> 76 

<170> Patentln version 3.1 

<210> 1 
<211> 1186 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 1 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 

ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 

tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 

ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 

cggcgcgagg acaagccgtt ccgcccgtcc ctcatcgcga tggacccgcc ggaacacggc 300 

aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 

cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agaactatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcgtt cctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgacct cacgcttcgc cacggcggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 
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gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacg 1186 



<210> 2 

<211> 404 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 2 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
1 5 io is 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro 
35 40 



Thr Arg Leu Glu Asp lie Arg Glu 
50 55 



Ser Asp Arg Gin Ser Pro Ser Phe 
65 70 



Arg Arg Glu Asp Lys Pro Phe Arg 
85 



Ser Gly Gin Thr Ala Trp Ala Leu 
45 

Met Leu Ser Ser Pro His Phe Ser 
60 

Pro Leu Met Val Ala Arg Gin He 
75 80 

Pro Ser Leu He Ala Met Asp Pro 
90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 ' 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 

Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 leo 

Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 

Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



2 
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Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Aan Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Phe Leu Leu Leu 
225 230 235 240 



lie Ala Gly His Glu Thr Thr Ala Asn Met lie Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys lie Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



' Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr lie Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



3 
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<210> 3 

<211> 195 

<212> DNA 

<213> Amycolatopsis orientalis 



<400> 3 

atgaagatca tcgcggacac cgggaagtgc gtgggggcgg gccagtgcgt gctcaccgat 60 

cccgatctgt tcgaccagag cgaggacgac gggacggtcc tcctgctgaa cgccgagccc 120 

gaaggcgaag aggcggagga gaacgcgcgc accgccgtgc acatctgccc ggggcaggca 180 

ctttcgctcg cgtag 195 



<210> 4 

<211> 64 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 4 



Met Lys He He Ala Asp Thr Gly Lys Cys Val Gly Ala Gly Gin Cys 
1 5 10 15 



Val Leu Thr Asp Pro Asp Leu Phe Asp Gin Ser Glu Asp Asp Gly Thr 
20 25 * 30 



Val Leu Leu Leu Asn Ala Glu Pro Glu Gly Glu Glu Ala Glu Glu Asn 
35 40 * 45 



Ala Arg Thr Ala Val His He Cys Pro Gly Gin Ala Leu Ser Leu Ala 
50 55 60 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 5 

tcctcatcgc cggccacgag ac 22 

<210> 6 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 
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<223> Synthetic 



<400> 6 

tgctggtcgc cggccacgag ac 



22 



<210> 7 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 7 

tgctcatcac cggccaggac ac 22 

<210> 8 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 



<210> 9 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 9 

tgctcatcgc gggccacgag ac 22 

<210> 10 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 



<400> 8 

ctgttcgccg ggcacgactc 



20 



<400> 10 

tgctggtcgc cgggcacgag ac 



22 



<210> 11 
<211> 21 
<212> DNA 
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<213> 



Artificial sequence 



<220> 
<223> 



Synthetic 



<400> 11 

cggcgcggtg gaggaactgc t 



21 



<210> 12 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 12 

gggcgccgtc gaggagctgc t 21 

<210> 13 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 13 

ccgcaccctg gaggagctgc t 21 



<210> 14 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 



<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 15 

cgcggcggtg gaggagatgc t 21 



<400> 14 

cggcgcggtc gaggagatgc t 



21 



<210> 



15 
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<210> 16 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 16 

cggcgcgatc gaggagaccc t 



21 



<210> 17 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 17 

ttcggcttcg gcgtgcacca gtgcctgggc 



30 



<210> 18 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 18 

ttcggcttcg gcgtccacca gtgcctggga 



30 



<210> 19 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 19 

ttcggctggg gcccccacca ctgcctgggc 



30 



<210> 20 

<211> 30 

<212> DNA 

<213> Artificial sequence 

<220> / 

<223> Synthetic 

<400> 20 
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ttcggtcacg gcgtccacaa gtgtcctggc 30 



<210> 21 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 21 

ttcgggcacg gagcgcacca ctgcatcggc 30 



<210> 22 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 22 

ttcggccacg gcatccactt ctgcgtgggc 30 



<210> 23 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 23 

tgctgctsdt cgccggbcab gasac 25 



<210> 


24 


<211> 


25 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc feature 


<222> 


(9) (9) 


<223> 


n=a, c, g or t 


<400> 


24 



tgmtssysnt cgscgsbcay gasac 25 
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<210> 25 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 25 

cggvgcsvts gaggarmtgc tgcg 24 



<210> 26 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 26 

cgcagcakyt cctcsabsgc bccg 24 



<210> 27 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 27 

gcccaggcas ahcacsywg gcdybggctt 30 

<210> 28 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 28 

gcgagatcta cctggggaag gacaacc 27 



<210> 29 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 29 
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gcgaagctta cggacttgga ccctacg 27 

<210> 30 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 30 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 

ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 

tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 

ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 

cggcgcgagg acaagccgtt ccgcccgtcc ctcatcgcga tggacccgcc ggaacacggc 300 

aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 

cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 

l 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agagctatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcgtt cttgctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgaccc cacgcttcgc cacggcggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 

<210> 31 
<211> 404 
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<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 31 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
1 5 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 ~ 30 



Pro Val Ser Arg Val Gly lieu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 " 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin lie 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu lie Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 * no 



Lys Arg Met Lys Ala Leu Gin Pro Arg lie Gin Gin lie Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 .135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Ser Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
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195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Phe Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Pro Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 32 
<211> 1215 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 32 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 

ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 

tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 

ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 

cggcgcgagg acaagccgtt ccgcccgtcc ctcatcgcga tggacccgcc ggaacacggc 300 

aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 

cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgtac gagtcgctcg agaactatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc cgcctggtcg gtctggcgtt cctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgacct cacgcttcgc cacggcggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 

<210> 33 

<211> 404 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 
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<400> 33 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
1 5 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp He Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu He Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Tyr Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 
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Gly Glu Ala Asp His Gly Arg Leu Val Gly Leu Ala Phe Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 * 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 34 

<211> 1215 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
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<400> 34 
atgaccgacg 



tcgaggaaac 



caccgcgacc ttgccactgg cccgcaaatg cccgttttca 



60 



ccaccgcccg 



aatacgagcg 



gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 



120 



tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 
ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 



aaggccaggc 


gtgacgtcgt 


cggggaattc accgtcaagc gcatgaaagc gcttcagcca 


360 


cgtattcagc 


agatcgtcga 


cgagcatatc gacgccctgc tcgccggccc caaacccgcc 


420 


gatctcgtcc 


aggcgctttc 


cctgccggtt ccgtccttgg tgatctgcga actgctcggt 


480 


gtcccctatt 


cggaccacga 


gttcttccag tcctgcagtt cccggatgct cagccgggaa 


540 


crtcacccrccQ 


aagaacggat 


gaccgcgttc gagtcgctcg agaactatct cgacgaactc 


600 


gtcacgaaga 


aggaggcgaa 


cgccaccgag gacgacctcc tcggccgcca gatcctgaag 


660 


cagcgcgaat 


ccggcgaagc 


cgaccacggc gaactggtcg gtctggcggc gctcctgctc 


720 


atcgcggggc 


acgagactac 


ggcgaacatg atctcgctcg gcacggtgac cctgctggag 


780 


aaccccgatc 


agctggcgaa 


gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 


840 


gaactcctgc 


ggatcttcac 


catcgcggag acggcgacct cacgcttcgc cacggcggac 


900 


gtcgagatcg 


gcggcacgct 


catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 


960 


ggcaaccacg 


atccggacgg 


cttcgagaac ccggacacct tcgacatcga acgcggcgcg 


1020 


cggcatcacg 


tcgccttcgg 


attcggtgtg caccaatgcc tcggccagaa cttggcgagg 


1080 


ttggaactcc 


agatcgtgtt 


cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 


1140 


gtaccggtcg 


acgaactgcc 


gttcaagcac gattcgacga tctacggcct ccacgccctg 


1200 


ccggtcacct 


ggtag 




1215 



<210> 35 
<211> 404 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 35 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



cggcgcgagg 



acaagccgtt 



ccgcccgtcc ctcgtcgcga tggacccgcc ggaacacggc 



300 
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Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin lie 
65 70 75 ~ 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His lie Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 ' 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 
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He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 36 
<211> 1104 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 36 

gcgaccttgc cgctggcccg caaatgcccg ttttcaccgc cgcccgaata cgagcggctt 60 
cgccgggaaa gtccggtttc ccgggtcggt ctcccgtccg gtcaaaccgc ttgggcgctc 120 
acccggctcg aggacatccg cgaaatgctg agcagtccgc atttcagctc cgaccggcag 180 
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agtccgtcgt 


tcccgctgat 


ggtggcccgg cagatccggc gcgaggacaa gccgttccgc 


240 


ccgtccctca 


tcgcgatgga 


eccgccggaa cacagcaagg ccaggcgtga cgtcgtcggg 


300 


gaattcaccg 


tcaagcgcat 


gaaagcgctt cagccgcgta ttcagcagat cgtcgacgag 


360 


catatcgacg 


ccatgctcgc 


cggccccaaa cccgccgatc tcgtccaggc gctttccctg 


420 


ccggttccgt 


ccttggtgat 


ctgcgaactg ctcggtgtcc cctattcgga ccacgagttc 


480 


ttccagtcct 


gcagttcccg 


gatgctcagc cgggaagtca ccgccgaaga acggatgacc 


540 


gcgttcgagt 


cgctcgagaa 


ctatctcgac gaactcgtca cgaagaagga ggcgaacgcc 


600 


accgaggacg 


acctcctcgg 


ccgccagatc ctgaagcagc gcgaaacggg cgaagccgac 


660 


cacggcgaac 


tcgtcgggct 


ggcgttcctg ctgctcatcg cgggacacga gacgacggcg 


720 


aacatgatct 


cgctcggcac 


ggcgaccctg ctggagaacc ccgaccagct ggcgaagatc 


780 


aaggccgatc 


cgggcaagac 


cctcgccgcg atcgaggagc tcctgcgggt cttcaccatc 


840 


gcggagacgg 


cgacctcacg 


cttcgccacg gcggacgtcg agatcggcgg cacgctcatc 


900 


cgcgcgggtg 


aaggcgtcgt 


cggcctgagc aacgcgggca accacgatcc ggaaggcttc 


960 


gagaacccgg 


acgccttcga 


catcgaacgc ggcgcgcggc accacgtcgc cttcggattc 


1020 


ggtgtgcacc 


aatgcctcgg 


ccagaacttg gcgaggttgg aactccagat cgtgttcgat 


1080 


acgttgttcc 


ggcgagtgcc 


gggc 


1104 



<210> 37 
<211> 1103 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 37 

gaccttgccg ctggcccgga aatgcccgtt ttcgccgccg cccgaatacg aacggcttcg 60 
ccgggaaagt ccggtttccc gggtcggtct cccgtccggt caaacggctt gggcgctcac 120 
ccggctcgaa gacatccgcg aaatgctgag cagcccgcat ttcagttccg accggcagag 180 
cccgtcgttc ccgctgatgg tcgcgcggca gatccgccgc gaggacaagc cgttccgccc 240 
ctccctcatc gcgatggatc cgccggaaca cagccgggcc aggcgtgacg tcgtcgggga 300 
attcaccgtc aagcggatga aggcgctcca gccgcgaatt cagcagatcg tcgacgaaca 360 
tctcgacgcc ctgctcgcgg gccccaaacc cgccgatctc gtccaggcgc tttccctgcc 420 
cgttccctcg ctggtgatct gcgaactgct cggcgtcccc tattcggacc acgagttctt 480 
ccagtcctgc agttccagga tgctcagccg ggaggtcacc gccgaagaac ggatgaccgc 540 
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gttcgagcag ctcgaaaact atctcgacga actggtcacc aagaaggagg cgaacgccac 600 

cgaggacgac ctcctcggcc gtcagatcct gaaacagcgg gaaacgggcg aggccgacca 660 

cggtgaactc gtcgggctgg cgttcctgct gctcatcgcc ggacacgaga ccacggcgaa 720 

catgatctcg ctcggcacgg tgaccctgct ggagaatccc gatcagctcg cgaagatcaa 780 

ggcagacccc ggcaagaccc tcgccgccat cgaggaactc ctgcgggtct tcacgatcgc 840 

ggaaacggcg acctcacgct tcgccacggc ggacgtcgag atcggcggaa cgctgatccg 900 

cgcgggggaa ggggtggtgg gcctgagcaa cgcgggcaac cacgatccgg acggcttcga 960 

gaaeccggac accttcgaca tcgaacgcgg cgcgcggcat cacgtcgcgt tcggattcgg 1020 

ggtgcaccag tgtctcggcc agaacttggc gaggttggaa ctccagatcg tcttcgatac 1080 

gttgttccgg cgagtgccgg gcc 1103 

<210> 38 
<211> 817 
<212> DNA 

<213> Araycolatopsis orientalis 
<400> 38 

cttcacccgc gcggatgagc gtgccgccga tctcgacgtc cgccgtggcg aagcgtgagg 60 

tcgccgtctc cgcgatggtg aagatccgca ggagttcctc gatcgcggcg agggtcttgc 120 

ccggatccgc cttgatcttc gccagctgat cggggttctc cagcagggtc accgtgccga 180 

gcgagatcat gttcgccgta gtctcgtgcc ccgcgatgag caggaggaac gccagaccga 240 

ccagttcgcc gtggtcggct tcgccggatt cgcgctgctt caggatctgg cggccgagga 300 

ggtcgtcctc ggtggcgttc gcctccttct tcgtgacgag ttcgtcgaga tagttctcga 360 

gcgactcgaa cgcggtcatc cgttcttcgg cggtgacttc ccggctgagc atccgggaac 420 

tgcaggactg gaagaactcg tggtccgaat aggggacacc gagcagttcg cagatcacca 480 

aggacggaac cggcagggaa agcgcctgga cgagatcggc gggtttgggg ccggcgagca 540 

gggcgtcgab atgctcgtcg acgatctgct gaatacgtgg ctgaagcgct ttcatgcgct 600 

tgacggtgaa ttccccgacg acgtcacgcc tggccttgcc gtgttccggc gggtccatcg 660 

cgatgaggga cgggcggaac ggcttgtcct cgcgccggat ctgccgcgcc accatcagcg 720 

ggaacgacgg actctgccgg tcggagctga aatgcggact gctcagcatt tcgcggatgt 780 

cttcgagccg ggtgagcgcc caagcggttt gaccgga 817 

<210> 39 
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<211> 1105 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 39 

ccgcgacctt gccgctggcc cgcaaatgcc cgttttcacc gccgcccgaa tacgagcggc 60 

ttcgccggga aagtccggtt tcccgggtcg gtctcccgtc cggtcaaacc gcttgggcgc 120 

tcacccggct cgaggacatc cgcgaaatgc tgagcagtcc gcatttcagc tccgaccggc 180 

agagtccgtc gttcccgctg atggtggccc ggcagatccg gcgcgaggac aagccgttcc 240 

gcccgtccct catctcgatg gacccgccgg aacacagcaa ggccaggcgt gacgtcgtcg 300 

gggaattcac cgtcaagcgc atgaaagcgc ttcagccgcg tattcagcag atcgtcgacg 360 

agcatatcga cgccctgctc gccggcccca aacccgccga tctcgtccag gcgctttccc 420 

tgccggttcc gtccttggtg atctgcgaac tgctcggtgt cccctattcg gaccacgagt 480 

tcttccagtc ctgcagttcc cggatgctca gccgggaagt caccgccgaa gaacggatga 540 

ccgcgttcga gtcgctcgag aactatctcg acgaactcgt cacgaagaag gaggcgaacg 600 

ccaccgagga cgacctcctc ggccgccaga tcctgaagca gcgcgaaacg ggcgaagccg 660 

accacggcga actggtcggg ctggcgttcc tcctgctcat cgcgggacac gagacgacgg 720 

cgaacatgat ctcgctcggc acggcgaccc tgctggagaa ccccgaccag ctggcgaaga 780 

tcaaggccga tccgggcaag accctcgccg cgatcgagga gctcctgcgg gtcttcacca 840 

tcgcggagac ggcgacctca cgcttcgcca cggcggacgt cgagatcggc ggcacgctca 900 

tccgcgcggg tgaaggcgtc gtcggcctga gtaacgcggg caaccacgat ccggaaggct 960 

tcgagaaccc ggacgccttc gacatcgaac gcggcgcgcg gcaccacgtc gccttcggat 1020 

tcggtgtgca ccaatgcctc ggccagaact tggcgaggtt ggaactccag atcgtgttcg 1080 

atacgttgtt ccggcgagtg ccggg 1105 

<210> 40 
<211> 1304 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 40 

ccttgccact ggcccgcaaa tgcccgtttt caccaccgcc cgaatacgag cggctccgcc 60 

gggaaagtcc ggtttcccgg gtcggtctcc cctccggtca aaccgcfctgg gcgctcaccc 120 

ggctcgaaga catccgcgaa atgctgagca gtccgcattt cagctccgac cggcagagtc 180 

cgtcgttccc gctgatggtg gcgcggcaga tccggcgcga ggacaagccg ttccgcccgt 240 



21 



WO 2004/061116 



PCT/US2003/034082 





gauggacucg 


ccggaacacg gcaaggccag gcgcgacgcc gncggggaat 


300 


LLULUy ULaa 


ycycauyacta 


gcgcttcagc cacgfcattca gcagatcgtc gacgagcata 


360 


LLLJaLyLLLL 


gcucgccygc 


cccaaacccg ccgafcctcgfc ccaggcgctt tccctgccgg 


420 


LLLL^UUCLU 


gg t ga u c ugc 


gaacugctcg gtgtccccta txcggaccac gagttcctcc 


480 


aguCCCycay 


ctcccggatg 


ctcagccggg aagtcaccgc cgaagaacgg atgaccgcgt 


540 


ccgagccgcu 


cgagaacfcat 


ctcgacgaac tcgtcacgaa gaaggaggcg aacgccaccg 


600 


agg&cgacct 


cctcggccgc 


cagatcctga agcagcgcga atccggcgaa gccgaccacg 


660 


gcgaactggt 


cggtctggcg 


ttcctcctgc tcatcgcggg gcacgagact acggcgaaca 


720 


tgatctcgct 


cggcacggtg 


accctgctgg agaaccccga tcagctggcg aagatcaagg 


780 


cggatccggg 


caagaccctc 


gccgcgatcg aggaactcct gcggatcttc accatcgcgg 


840 


agacggcgac 


ctcacgcttc 


gccacggcgg acgtcgagat cggcggcacg ctcatccgcg 


900 


cgggtgaagg 


cgtcgtcggc 


ctgagcaacg cgggcaacca cgatccggac ggcttcgaga 


960 


acccggacac 


cttcgacatc 


gaacgcggcg cgcggcatca cgtcgccttc ggattcggtg 


1020 


tgcaccaatg 


cctcggccag 


aacttggcga ggttggaact ccagatcgtg ttcgatacgt 


1080 


tgttccggcg 


agtgccgggc 


atccggatcg ccgtaccggt cgacgaactg ccgttcaagc 


1140 


acgattcgac 


gatctacggc 


ctccgcgccc tgccggtcac ctggtaggag gagccatgaa 


1200 


gatcatcgcg 


gacaccggga 


agtgcgtggg ggcgggccag tgcgtgctca ccgatcccga 


1260 


tctgttcgac 


cagagcgagg 


acgacgggac ggtcctcctg ctga 


1304 



<210> 41 
<211> 825 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 41 

ctccggtcaa accgcttggg cgctcacccg gctcgaagac atccgcgaaa tgctgagcag 60 
tccgcatttc agctccgacc ggcagaatcc gtcgttcccg ctgatggtgg cgcggcagat 120 
ccggcgcgag gacaagccgt tccgcccgtc cctcatcgcg atggacccgc cggaacacag 180 
caaggccagg cgtgacgtcg tcggggaatt caccgtcaag cgcatgaaag cgcttcagcc 240 
gcgtattcag cagatcgtcg acgagcatat cgacgccctg ctcgccggcc ccaaacccgc 300 
cgatctcgtc caggcgcttt ccctgccggt tccgtccttg gtgatctgcg aactgctcgg 360 
tgtcccctat tcggaccacg agttcttcca gtcctgcagt tcccggatgc tcagccggga 420 
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agtcaccgcc gaagaacgga tgaccgcgtt cgagtcgctc gagaactatc tcgacgaact 480 

cgtcacgaag aaggaggcga acgccaccga ggacgacctc ctcggccgcc agatcctgaa 540 

gcagcgggaa acgggcgagg ccgaccacgg cgaactcgtc gggctggcgt tcctgctgct 600 

catcgccggg cacgagacga cggcgaacat gatctcgctc ggcacggcga ccctgctgga 660 

gaaccccgac cagctggcga agatcaaggc ggatccgggc aagaccctcg ccgcgatcga 720 

ggaactgctg cgcgtcttca cgatcgcgga gacggcgacc tcacgcttcg ccacggcgga 780 

cgtcgagatc ggcggcacgc tcatccgcgc gggtgaaggc gtcgt 825 

<210> 42 
<211> 1103 
<212> DNA 

<213> Amycolatopsis orientalis 
<400> 42 

gcgaccttgc cactggcccg caaatgcccg ttttcaccac cgcccgaata cgagcggctc 60 

cgccgggaaa gtccggtttc ccgggtcggt ctcccctccg gtcaaaccgc ttgggcgctc 120 

acccggctcg aagacatccg cgaaatgctg agcagtccgc atttcagctc cgaccggcag 180 

agtccgtcgt tcccgctgat ggtggcgcgg cagatccggc gcgaggacaa gccgttccgc 240 

ccgtccctca tcgcgatgga cccgccggaa cacggcaagg ccaggcgtga cgtcgtcggg 300 

gaattcaccg tcaagcgcat gaaagcgctt cagccacgta ttcagcagat cgtcgacgag 360 

catafccgacg ccctgctcgc cggccccaaa cccgccgatc tcgtccaggc gctttccctg 420 

ccggttccgt ccttggtgat ctgcgaactg ctcggtgtcc cctathcgga ccacgagt'tc 480 

ttccagtcct gcagttcccg gatgctcagc cgggaagtca ccgccgaaga acggatgacc . 540 

gcgttcgagt cgctcgagaa ctatctcgac gaactcgtca cgaagaagga ggcgaacgcc 600 

accgaggacg acctcctcgg ccgccagatc ctgaagcagc gcgaatccgg cgaagccgac 660 

cacggcgaac tggtcggtct ggcgttcctc ctgctcatcg cggggcacga gactacggcg 720 

aacatgatct cgctcggcac ggtgaccctg ctggagaacc ccgatcagct ggcgaagatc 780 

aaggcggatc cgggcaagac cctcgccgcg atcgaggaac tcctgcggat cttcaccatc 840 

gcggagacgg cgacctcacg cttcgccacg gcggacgtcg agatcggcgg cacgctcatc 900 

cgcgcgggtg aaggcgtcgt cggcctgagc aacgcgggca accacgatcc ggacggcttc 960 

gagaacccgg acaccttcga catcgaacgc ggcgcgcggc atcacgtcgc cttcggattc 1020 

ggtgtgcacc aatgcctcgg ccagaacttg gcgaggttgg aactccagat cgtgttcgat 1080 
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acgttgttcc ggcgagtgcc ggg X103 



<210> 43 

<211> 402 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 43 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
1 5 io 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp He Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu He Ala Met Asp Pro 
85 90 95 



Pro Glu His Ser Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Met Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 
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Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Thr 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Phe Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Ala 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg Val Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Glu Gly Phe Glu Asn Pro Asp Ala Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val 
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<210> 44 

<211> 367 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 44 

Thr Leu Pro Leu Ala Arg Lys Cys Pro Phe Ser Pro Pro Pro Glu Tyr 
1 5 10 15 



Glu Arg Leu Arg Arg Glu Ser Pro Val Ser Arg Val Gly Leu Pro Ser 
20 25 30 



Gly Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp lie Arg Glu Met 
35 40 45 



Leu Ser Ser Pro His Phe Ser Ser Asp Arg Gin Ser Pro Ser Phe Pro 
50 55 60 



Leu Met Val Ala Arg Gin lie Arg Arg Glu Asp Lys Pro Phe Arg Pro 
65 70 75 " 80 



Ser Leu lie Ala Met Asp Pro Pro Glu His Ser Arg Ala Arg Arg Asp 
85 90 95 



Val Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro Arg 
100 105 110 



lie Gin Gin lie Val Asp Glu His Leu Asp Ala Leu Leu Ala Gly Pro 
115 120 125 



Lys Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser Leu 
130 135 140 



Val lie Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe Phe 
145 150 155 160 



Gin Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu Glu 
165 170 175 



Arg Met Thr Ala Phe Glu Gin Leu Glu Asn Tyr Leu Asp Glu Leu Val 
180 185 190 



Thr Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg Gin 
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195 200 205 



lie Leu Lys Gin Arg Glu Thr Gly Glu Ala Asp His Gly Glu Leu Val 
210 215 220 



Gly Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala Asn 
225 230 235 240 



Met lie Ser Leu Gly Thr Val Thr Leu Leu Glu Asn Pro Asp Gin Leu 
245 250 *" 255 



Ala Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala He Glu Glu 
260 265 270 



Leu Leu Arg Val Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe Ala 
275 280 285 



Thr Ala Asp Val Glu He Gly Gly Thr Leu He Arg Ala Gly Glu Gly 
290 295 300 



Val Val Gly Leu Ser Asn Ala Gly Asn His Asp Pro Asp Gly Phe Glu 
305 310 315 320 



Asn Pro Asp Thr Phe Asp He Glu Arg Gly Ala Arg His His Val Ala 
325 330 335 



Phe Gly Phe Gly Val His Gin Cys Leu Gly Gin Asn Leu Ala Arg Leu 
340 345 350 



Glu Leu Gin He Val Phe Asp Thr Leu Phe Arg Arg Val Pro Gly 
355 360 365 



<210> 45 

<211> 272 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 45 

Ser Gly Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp He Arg Glu 
1 5 10 15 



Met Leu Ser Ser Pro His Phe Ser Ser Asp Arg Gin Ser Pro Ser Phe 
20 25 " 30 
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Pro Leu Met Val Ala Arg Gin lie Arg Arg Glu Asp Lys Pro Phe Arg 
35 40 45 



Pro Ser Leu lie Ala Met Asp Pro Pro Glu His Gly Lys Ala Arg Arg 
50 55 60 



Asp Val Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro 
65 70 75 80 



Arg lie Gin Gin lie Val Asp Glu His He Asp Ala Leu Leu Ala Gly 
85 90 95 



Pro Lys Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser 
100 105 110 



Leu Val He Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe 
115 120 125 



Phe Gin Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu 
130 135 140 



Glu Arg Met Thr Ala Phe Glu Ser Leu Glu Asn Tyr Leu Asp Glu Leu 
145 150 155 160 



Val Thr Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg 
165 170 175 



Gin He Leu Lys Gin Arg Glu Ser Gly Glu Ala Asp His Gly Glu Leu 
180 185 190 

Val Gly Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala 
195 200 205 



Asn Met He Ser Leu Gly Thr Val Thr Leu Leu Glu Asn Pro Asp Gin 
210 215 220 



Leu Ala Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala He Glu 
225 230 235 240 



Glu Leu Leu Arg He Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe 
245 250 255 
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Ala Thr Ala Asp Val Glu lie Gly Gly Thr Leu He Arg Ala Gly Glu 
260 265 270 



<210> 46 
<211> 367 
<212> PRT 

<213> Amycolatopsis orientalis 
<400> 46 

Ala Thr Leu Pro Leu Ala Arg Lys Cys Pro Phe Ser Pro Pro Pro Glu 
15 10 15 



Tyr Glu Arg Leu Arg Arg Glu Ser Pro Val Ser Arg Val Gly Leu Pro 
20 25 30 



Ser Gly Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp He Arg Glu 
' 35 40 45 



Met Leu Ser Ser Pro His Phe Ser Ser Asp Arg Gin Ser Pro Ser Phe 
50 55 ~ ' 60 



Pro Leu Met Val Ala Arg Gin He Arg Arg Glu Asp Lys Pro Phe Arg 
65 70 75 80 



Pro Ser Leu He Ser Met Asp Pro Pro Glu His Ser Lys Ala Arg Arg 
85 90 95 



Asp Val Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro 
100 105 ™ 110 



Arg He Gin Gin He Val Asp Glu His He Asp Ala Leu Leu Ala Gly 
115 120 125 



Pro Lys Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser 
130 135 140 



Leu Val He Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe 
145 150 155 160 



Phe Gin Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu 
165 170 " 175 



Glu Arg Met Thr Ala Phe Glu Ser Leu Glu Asn Tyr Leu Asp Glu Leu 
180 185 190 
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Val Thr Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg 
195 200 ~ 205 



Gin lie Leu Lys Gin Arg Glu Thr Gly Glu Ala Asp His Gly Glu Leu 
210 215 220 



Val Gly Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala 
225 230 235 240 



Asn Met He Ser Leu Gly Thr Ala Thr Leu Leu Glu Asn Pro Asp Gin 
245 250 255 



Leu Ala Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala He Glu 
260 265 270 



Glu Leu Leu Arg Val Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe 
275 280 285 



Ala Thr Ala Asp Val Glu He Gly Gly Thr Leu He Arg Ala Gly Glu 
290 295 300 



Gly Val Val Gly Leu Ser Asn Ala Gly Asn His Asp Pro Glu Gly Phe 
305 310 315 ~ 320 



Glu Asn Pro Asp Ala Phe Asp He Glu Arg Gly Ala Arg His His Val 
325 330 335 



Ala Phe Gly Phe Gly Val His Gin Cys Leu Gly Gin Asn Leu Ala Arg 
340 345 350 



Leu Glu Leu Gin He Val Phe Asp Thr Leu Phe Arg Arg Val Pro 
355 360 365 



<210> 47 
<211> 394 
<212> PRT 

<213> Amycolatopsis orientalis 
<400> 47 

Leu Pro Leu Ala Arg Lys Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu 
15 io 15 
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Arg Ijeu Arg Arg Glu Ser Pro Val Ser Arg Val Gly Leu Pro Ser Gly 
20 25 30 



Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp lie Arg Glu Met Leu 
35 40 45 



Ser Ser Pro His Phe Ser Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu 
50 55 60 



Met Val Ala Arg Gin He Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser 
65 70 75 80 



Leu He Ala Met Asp Pro Pro Glu His Gly Lys Ala Arg Arg Asp Val 
85 90 95 



Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro Arg He 
100 105 110 



Gin Gin He Val Asp Glu His He Asp Ala Leu Leu Ala Gly Pro Lys 
115 120 125 



Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser Leu Val 
130 135 140 



He Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe Phe Gin 
145 150 155 160 



Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu Glu Arg 
165 170 175 



Met Thr Ala Phe Glu Ser Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr 
180 185 190 



Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg Gin He 
195 200 205 



Leu Lys Gin Arg Glu Ser Gly Glu Ala Asp His Gly Glu Leu Val Gly 
210 215 220 



Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala Asn Met 
225 230 235 240 



He Ser Leu Gly Thr Val Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala 
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245 250 255 



Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala lie Glu Glu Leu 
260 265 270 



Leu Arg He Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr 
275 280 285 



Ala Asp Val Glu He Gly Gly Thr Leu He Arg Ala Gly Glu Gly Val 
290 295 300 



Val Gly Leu Ser Asn Ala Gly Asn His Asp Pro Asp Gly Phe Glu Asn 
305 310 315 320 



Pro Asp Thr Phe Asp He Glu Arg Gly Ala Arg His His Val Ala Phe 
325 330 335 



Gly Phe Gly Val His Gin Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu 
340 345 350 



Leu Gin He Val Phe Asp Thr Leu Phe Arg Arg Val Pro Gly He Arg 
355 360 365 



He Ala Val Pro Val Asp Glu Leu Pro Phe Lys His Asp Ser Thr He 
370 375 380 



Tyr Gly Leu Arg Ala Leu Pro Val Thr Trp 



385 


390 


<210> 


48 


<211> 


274 


<212> 


PRT 


<213> 


Amycolatopsis oriental is 


<400> 


48 



Ser Gly Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp He Arg Glu 
1 5 10 15 



Met Leu Ser Ser Pro His Phe Ser Ser Asp Arg Gin Asn Pro Ser Phe 
20 25 30 



Pro Leu Met Val Ala Arg Gin He Arg Arg Glu Asp Lys Pro Phe Arg 
35 40 ** 45 
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Pro Ser Leu lie Ala Met Asp Pro Pro Glu His Ser Lys Ala Arg Arg 
50 55 60 



Asp Val Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro 
65 70 75 80 



Arg lie Gin Gin lie Val Asp Glu His He Asp Ala Leu Leu Ala Gly 
85 90 95 



Pro Lys Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser 
100 105 HO 



Leu Val He Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe 
115 120 125 



Phe Gin Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu 
130 135 140 



Glu Arg Met Thr Ala Phe Glu Ser Leu Glu Asn Tyr Leu Asp Glu Leu 
145 150 155 160 



Val Thr Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg 
165 170 175 



Gin He Leu Lys Gin Arg Glu Thr Gly Glu Ala Asp His Gly Glu Leu 
180 185 190 



Val Gly Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala 
195 200 ^ 205 



Asn Met He Ser Leu Gly Thr Ala Thr Leu Leu Glu Asn Pro Asp Gin 
210 215 220 



Leu Ala Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala lie Glu 
225 230 " 235 240 



Glu Leu Leu Arg Val Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe 
245 250 255 



Ala Thr Ala Asp Val Glu He Gly Gly Thr Leu He Arg Ala Gly Glu 
260 265 270 
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Gly Val 



<210> 49 

<211> 367 

<212> PRT 

<213> Amycolatopsis orientalis 

<400> 49 



Ala Thr Leu Pro Leu Ala Arg Lys Cys Pro Phe Ser Pro Pro Pro Glu 
15 10 15 



Tyr Glu Arg Leu Arg Arg Glu Ser Pro Val Ser Arg Val Gly Leu Pro 
20 25 30 



Ser Gly Gin Thr Ala Trp Ala Leu Thr Arg Leu Glu Asp He Arg Glu 
35 40 45 



Met Leu Ser Ser Pro His Phe Ser Ser Asp Arg Gin Ser Pro Ser Phe 
50 55 60 



Pro Leu Met Val Ala Arg Gin He Arg Arg Glu Asp Lys Pro Phe Arg 
65 70 75 80 



Pro Ser Leu He Ala Met Asp Pro Pro Glu His Gly Lys Ala Arg Arg 
85 90 95 



Asp Val Val Gly Glu Phe Thr Val Lys Arg Met Lys Ala Leu Gin Pro 
100 105 110 



Arg He Gin Gin He Val Asp Glu His He Asp Ala Leu Leu Ala Gly 
115 120 125 



Pro Lys Pro Ala Asp Leu Val Gin Ala Leu Ser Leu Pro Val Pro Ser 
130 135 140 



Leu Val He Cys Glu Leu Leu Gly Val Pro Tyr Ser Asp His Glu Phe 
145 150 155 160 



Phe Gin Ser Cys Ser Ser Arg Met Leu Ser Arg Glu Val Thr Ala Glu 
165 170 175 



Glu Arg Met Thr Ala Phe Glu Ser Leu Glu Asn Tyr Leu Asp Glu Leu 
180 185 190 
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Val Thr Lys Lys Glu Ala Asn Ala Thr Glu Asp Asp Leu Leu Gly Arg 
195 200 205 



Gin He Leu Lys Gin Arg Glu Ser Gly Glu Ala Asp His Gly Glu Leu 
210 215 220 



Val Gly Leu Ala Phe Leu Leu Leu He Ala Gly His Glu Thr Thr Ala 
225 230 235 240 



Asn Met He Ser Leu Gly Thr Val Thr Leu Leu Glu Asn Pro Asp Gin 
245 250 255 



Leu Ala Lys He Lys Ala Asp Pro Gly Lys Thr Leu Ala Ala He Glu 
260 265 270 



Glu Leu Leu Arg lie Phe Thr He Ala Glu Thr Ala Thr Ser Arg Phe 
275 280 285 



Ala Thr Ala Asp Val Glu He Gly Gly Thr Leu He Arg Ala Gly Glu 
290 295 300 



Gly Val Val Gly Leu Ser Asn Ala Gly Asn His Asp Pro Asp Gly Phe 
305 310 315 320 



Glu Asn Pro Asp Thr Phe Asp He Glu Arg Gly Ala Arg His His Val 
325 330 335 



Ala Phe Gly Phe Gly Val His Gin Cys Leu Gly Gin Asn Leu Ala Arg 
340 345 350 



Leu Glu Leu Gin He Val Phe Asp Thr Leu Phe Arg Arg Val Pro 
355 360 365 



<210> 50 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 50 

aggaaaccac cgcgaccttg ccact 25 
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<210> 51 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 51 

accgaatccg aaggcgacgt gatgc 25 



<210> 


52 


<211> 


23 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<400> 


52 


cggaatgaat ccatccgcat acg 


<210> 


53 


<211> 


23 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<400> 


53 


tgatcttcat ggctcctcct acc 


<210> 


54 


<211> 


35 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


mis cofeature 


<222> 


(18) . . (20) 


<223> 


n=a, c, g or t 


<400> 


54 



gcgaagccga ccacggcnnn ctggtcggtc tggcg 35 



<210> 55 
<211> 35 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<220> 

<22l> misc_feature 

<222> (16).. (18) 

<223> n=a, c, g or t 



<400> 55 

cgccagaccg accagnnngc cgtggtcggc ttcgc 35 



<210> 


56 


<211> 


35 


<212> 


DMA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc feature 


<222> 


(14) (14) 


<223> 


n=a, c, g or t 


<400> 


56 



ggtcggtctg gcgnysctcc tgctcatcgc ggggc 35 



<210> 57 

<211> 35 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 



<220> 

<221> misc_feature 

<222> (22) . . (22) 

<223> n=a, c, g or t 



<400> 57 

gccccgcgat gagcaggags rncgccagac cgacc 35 

<210> 58 

<211> 35 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Synthetic 
<220> 

<221> misc_feature 

<222> (17).. (17) 

<223> n=a, c r g or t 

<400> 58 

ggtcggtctg gcgttcnysc tgctcatcgc ggggc 

<210> 59 

<211> 35 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<220> 
<221> 
<222> 
<223> 

<400> 59 

gccccgcgat gagcagsrng aacgccagac cgacc 



35 



35 



misc_feature 
(19).. (19) 
n=a / c, g or t 



<210> 60 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 60 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 
ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 
tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 
ccgcatttca gctccgacca gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 
cggcgcgagg acaagccgtt ccgcccgtcc ctcgtcgcga tggacccgcc ggaacacggc 300 
aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 
cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 
gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 
gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 
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gtcaccgccg 


aagaacggat 


gaccgcgttc gagtcgctcg agaactatct cgacgaactc 


600 


gtcacgaaga 


aggaggcgaa 


cgccaccgag gacgacctcc tcggccgcca gatcctgaag 


660 


cagcgcgaat 


ccggcgaagc 


cgaccacggc gaactggtcg gtctggcggc gctcctgctc 


720 


atcgcggggc 


acgagactac 


ggcgaacatg atctcgctcg gcacggtgac cctgctggag 


780 


aaccccgatc 


agctggcgaa 


gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 


640 


gaactcctgc 


ggatcttcac 


catcgcggag acggcgacct cacgcttcgc cacggcggac 


900 


gtcgagatcg 


gcggcacgct 


catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 


960 


ggcaaccacg 


atccggacgg 


cttcgagaac ccggacacct tcgacatcga acgcggcgcg 


1020 


cggcatcacg 


tcgccttcgg 


attcggtgtg caccaatgcc tcggccagaa cttggcgagg 


1080 


ttggaactcc 


agatcgtgtt 


cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 


1140 


gtaccggtcg 


acgaactgcc 


gttcaagcac gattcgacga tctacggcct ccacgccctg 


1200 


ccggtcacct 


ggtag 




1215 



<210> 61 

<211> 404 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 61 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
1 5 10 15 

Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 

Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 

Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 

Ser Asp Gin Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 

Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Ala Met Asp Pro 
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85 



90 



95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin lie Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 
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Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 62 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 62 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 
ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 
tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 
ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 
cggcgcgagg acaagccgtt ccgcccgtcc ctcgtcggga tggacccgcc ggaacacggc 300 
aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 
cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 
gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 
gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 
gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agaactatct cgacgaactc 600 
gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 
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cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcggc gctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gatccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgacct cacgcttcgc cacggcggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agaccgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 



<210> 63 
<211> 404 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 63 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin lie 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Gly Met Asp Pro 
85 90 "* 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 HO 
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Lys Arg Met Lys Ala Leu Gin Pro Arg lie Gin Gin lie Val Asp Glu 
115 120 125 



His lie Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val lie Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



lie Ala Gly His Glu Thr Thr Ala Asn Met lie Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys lie Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala lie Glu Glu Leu Leu Arg lie Phe Thr lie 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu lie Gly 
290 295 300 



Gly Thr Leu lie Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp lie 
325 330 335 
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Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 

Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin Thr Val Phe Asp 
355 360 365 

Thr Leu Phe Arg Arg Val Pro Gly lie Arg lie Ala Val Pro Val Asp 
370 375 380 

Glu Leu Pro Phe Lys His Asp Ser Thr lie Tyr Gly Leu His Ala Leu 
385 390 395 400 

Pro Val Thr Trp 



<210> 64 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 64 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 

ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 

tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 

ccgcatttca gctccgaccg gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 

cggcgcgagg acaagccgtt ccgcccgtcc ctcgtcgcga tggacccgcc ggaacacggc 300 

aaggccaggc gtgacgccgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 

cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccgcc 420 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggatgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agaactatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcggc gctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggca gatccgggca agaccctcgc cgcgatcgag 840 
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gaactcctgc 


ggatcttcac 


catcgcggag acggcgacct cacgcttcgc cacggcggac 


900 


gtcgagatcg 


gcggcacgct 


catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 


960 


ggcaaccacg 


atccggacgg 


cttcgagaac ccggacacct tcgacatcga acgcggcgcg 


1020 


cggcatcacg 


tcgccttcgg 


attcggtgtg caccaatgcc tcggccagaa cttggcgagg 


1080 


ttggaactcc 


agatcgtgtt 


cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 


1140 


gtaccggtcg 


acgaactgcc 


gttcaagcac gattcgaega tctacggcct ccacgccctg 


1200 


ccggtcacct 


ggtag 




1215 



<210> 


65 


<211> 


404 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


65 



Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp He Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Ala Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 
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His lie Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 " 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 " 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 
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Cys lieu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin lie Val Phe Asp 
355 360 365 

Thr Leu Phe Arg Arg Val Pro Gly lie Arg lie Ala Val Pro Val Asp 
370 375 380 

Glu Leu Pro Phe Lys His Asp Ser Thr lie Tyr Gly Leu His Ala Leu 
385 390 395 400 

Pro Val Thr Trp 

<210> 66 

<211> 1215 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 66 



atgaccgacg 


tcgaggaaac 


caccgcgacc 


ttgccactgg 


ctcgcaaatg 


cccgttttca 


60 


ccaccgcccg 


aatacgagcg 


gctccgccgg 


gaaagtccgg 


tttcccgggt 


cggtctcccc 


120 


tccggtcaaa 


ccgcttgggc 


gctcacccgg 


ctcgaagaca 


tccgcgaaat 


gctgagcagt 


180 


ccgcatttca 


gctccgaccg 


gcagagtccg 


tcgttcccgc 


tgatggtggc 


gcggcagatc 


240 


cggcgcgagg 


acaagccgtt 


ccacccgtcc 


ctcgtcgcga 


tggacccgcc 


ggaacacggc 


300 


aaggccaggc 


gtgacgtcgt 


cggggaattc 


accgtcaagc 


gcatgaaagc 


gcttcagcca 


360 


cgtattcagc 


agatcgtcga 


cgagcatatc 


gacgccctgc 


tcgccggccc 


caaacccgcc 


420 


gatctcgtcc 


aggcgctttc 


cctgccggtt 


ccgtccttgg 


tgatctgcga 


actgctcggt 


480 


gtcccctatt 


cggaccacga 


gttcttccag 


tcctgcagtt 


cccggatgct 


cagccgggaa 


540 


gtcaccgccg 


aagaacggat 


gaccgcgttc 


gagtcgctcg 


agaactatct 


cgacgaactc 


600 


gtcacgaaga 


aggaggcgaa 


cgccaccgag 


gacgacctcc 


tcggccgcca 


gatcctgaag 


660 


cagcgcgaat 


ccggcgaagc 


cgaccacggc 


gaactggtcg 


gtctggcggc 


gctcctgctc 


720 


atcgcggggc 


acgagactac 


ggcgaacatg 


atctcgctcg 


gcacggtgac 


cctgctggag 


780 


aaccccgatc 


agctggcgaa 


gatcaaggcg 


gatccgggca 


agaccctcgc 


cgcgatcgag 


840 


gaactcctgc 


ggatcttcac 


catcgcggag 


acggcgacct 


cacgcttcgc 


cacggcggac 


900 


gtcgagatcg 


gcggcacgct 


catccgcgcg 


ggtgaaggcg 


tcgtcggcct 


gagcaacgcg 


960 
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J 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 



<210> 67 
<211> 404 
<212> PRT 

<213> Artificial sequence 
<220> 

<2 2 3 > Synthetic 
<400> 67 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe His Pro Ser Leu Val Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 
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Ala Leu Ser Leu Pro Val Pro Ser Leu Val lie Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys lie Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu lie Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
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370 375 380 

Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 

Pro Val Thr Trp 

<210> 68 

<211> 1215 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 

<400> 68 



atgaccgacg 


tcgaggaaac 


caccgcgacc 


ttgccactgg 


cccgcaaatg 


cccgttttca 


60 


ccaccgcccg 


aatacgagcg 


gctccgccgg 


aaaagtccgg 


tttcccgggt 


cggtctcccc 


120 


tccggtcaaa 


ccgcttgggc 


gctcacccgg 


ctcgaagaca 


tccgcgaaat 


gctgagcagt 


180 


ccgcatttca 


gctccgaccg 


gcagagtccg 


tcgttcccgc 


tgatggtggc 


gcggcagatc 


240 


cggcgcgagg 


acaagccgtt 


ccgcccgtcc 


ctcatcgcga 


tggacccgcc 


ggaacacggc 


300 


aaggccaggc 


gtgacgtcgt 


cggggaattc 


accgtcaagc 


gcatgaaagc 


gcttcagcca 


360 


cgtattcagc 


agatcgtcga 


cgagcatatc 


gacgccctgc 


tcgccggccc 


caaacccgcc 


420 


gatctcgtcc 


aggcgctttc 


cctgccggtt 


ccgtccttgg 


tgatctgcga 


actgctcggt 


480 


gtcccctatt 


cggaccacga 


gttcttccag 


tcctgcagtt 


cccggatgct 


cagccgggaa 


540 


gtcaccgccg 


aagaacggat 


gaccgcgttc 


gagtcgctcg 


agaactatct 


cgacgaactc 


600 


gtcacgaaga 


aggaggcgaa 


cgccaccgag 


gacgacctcc 


tcggccgcca 


gatcctgaag 


660 


cagcgcgaat 


ccggcgaagc 


cgaccacggc 


gaactggtcg 


gtctggcgtt 


cctcctgctc 


720 


atcgcggggc 


acgagactac 


ggcgaacatg 


atctcgctcg 


gcacggtgac 


cctgctggag 


780 


aaccccgatc 


agctggcgaa 


gatcaaggcg 


gatccgggca 


agaccctcgc 


cgcgatcgag 


840 


gaactcctgc 


ggatcttcac 


catcgcggag 


acggcgacct 


cacgcttcgc 


cacggcggac 


900 


gtcgagatcg 


gcggcacgct 


catccgcgcg 


ggtgaaggcg 


tcgtcggcct 


gagcaacgcg 


960 


ggcaaccacg 


atccggacgg 


cttcgagaac 


ccggacacct 


tcgacatcga 


acgcggcgcg 


1020 


cggcatcacg 


tcgccttcgg 


attcggtgtg 


caccaatgcc 


tcggccagaa 


cttggcgagg 


1080 


ttggaactcc 


agatcgtgtt 


cgatacgttg 


ttccggcgag 


tgccgggcat 


ccggatcgcc 


1140 
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gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 
ccggtcacct ggtag 1215 



<210> 


69 


<211> 


404 


<212> 


PRT 


<213> 


Artificial 


<220> 




<223> 


Synthetic 


<400> 


69 



Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Lys Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Arg Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu He Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Met 
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165 



170 



175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



lieu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Phe Leu Leu Leu 
225 230 235 240 



lie Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 A 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leii Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 
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Pro Val Thr Trp 



<210> 


70 


<211> 


35 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


mi s cofeature 


<222> 


(20)*. (21) 


<223> 


n=a, c, g or t 


<400> 


70 



gttccgcccg tccctcgtcn nsatggaccc gccgg 35 



<210> 


71 


<211> 


35 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<220> 




<221> 


misc feature 


<222> 


(15) . . (16) 


<223> 


n=a, c, g or t 



<400> 71 

cctgcagttc ccggnnsctc agccgggaag tcacc 35 

<210> 72 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 72 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 
ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 
tccggtcaga ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 
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ccgcatttca gctccgacca gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 

cggcgcgagg acaagccgtt ccgcccgtcc ctcgtcgcga tggacccgcc ggaacacggc 300 

aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaagc gcttcagcca 360 

cgtattcagc agatcgtcga cgagcatacc gacgccctgc tcgccggccc caaacccgcc 420 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccgggcgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agaactatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcggc gctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gacccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgacct cacgcttcgc cacggeggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 

<210> 73 
<211> 404 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 73 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 

Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 
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Thr Arg Leu Glu Asp He Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 55 60 



Ser Asp Gin Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His Thr Asp Ala Leu Leu Ala Gly Pro Lys Pro Ala Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu lieu Gly 
145 150 155 " 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Ala 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin He Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 
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Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 



Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin lie Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 74 
<211> 1215 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 74 

atgaccgacg tcgaggaaac caccgcgacc ttgccactgg cccgcaaatg cccgttttca 60 
ccaccgcccg aatacgagcg gctccgccgg gaaagtccgg tttcccgggt cggtctcccc 120 
tccggtcaaa ccgcttgggc gctcacccgg ctcgaagaca tccgcgaaat gctgagcagt 180 
ccgcatttca gctccgacca gcagagtccg tcgttcccgc tgatggtggc gcggcagatc 240 
cggcgcgagg acaagccgtt ccgcccgtcc ctcgtcgcga tggacccgcc ggaacacggc 300 
aaggccaggc gtgacgtcgt cggggaattc accgtcaagc gcatgaaggc gcttcagcca 360 
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cgtattcagc agatcgtcga cgagcatatc gacgccctgc tcgccggccc caaacccacc 420 

gatctcgtcc aggcgctttc cctgccggtt ccgtccttgg tgatctgcga actgctcggt 480 

gtcccctatt cggaccacga gttcttccag tcctgcagtt cccggtcgct cagccgggaa 540 

gtcaccgccg aagaacggat gaccgcgttc gagtcgctcg agaactatct cgacgaactc 600 

gtcacgaaga aggaggcgaa cgccaccgag gacgacctcc tcggccgcca gatcctgaag 660 

cagcgcgaat ccggcgaagc cgaccacggc gaactggtcg gtctggcggc gctcctgctc 720 

atcgcggggc acgagactac ggcgaacatg atctcgctcg gcacggtgac cctgctggag 780 

aaccccgatc agctggcgaa gatcaaggcg gacccgggca agaccctcgc cgcgatcgag 840 

gaactcctgc ggatcttcac catcgcggag acggcgacct cacgcttcgc cacggcggac 900 

gtcgagatcg gcggcacgct catccgcgcg ggtgaaggcg tcgtcggcct gagcaacgcg 960 

ggcaaccacg atccggacgg cttcgagaac ccggacacct tcgacatcga acgcggcgcg 1020 

cggcatcacg tcgccttcgg attcggtgtg caccaatgcc tcggccagaa cttggcgagg 1080 

ttggaactcc agatcgtgtt cgatacgttg ttccggcgag tgccgggcat ccggatcgcc 1140 

gtaccggtcg acgaactgcc gttcaagcac gattcgacga tctacggcct ccacgccctg 1200 

ccggtcacct ggtag 1215 



<210> 75 
<211> 404 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Synthetic 
<400> 75 

Met Thr Asp Val Glu Glu Thr Thr Ala Thr Leu Pro Leu Ala Arg Lys 
15 10 15 



Cys Pro Phe Ser Pro Pro Pro Glu Tyr Glu Arg Leu Arg Arg Glu Ser 
20 25 30 



Pro Val Ser Arg Val Gly Leu Pro Ser Gly Gin Thr Ala Trp Ala Leu 
35 40 45 



Thr Arg Leu Glu Asp lie Arg Glu Met Leu Ser Ser Pro His Phe Ser 
50 - 55 60 
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Ser Asp Gin Gin Ser Pro Ser Phe Pro Leu Met Val Ala Arg Gin He 
65 70 75 80 



Arg Arg Glu Asp Lys Pro Phe Arg Pro Ser Leu Val Ala Met Asp Pro 
85 90 95 



Pro Glu His Gly Lys Ala Arg Arg Asp Val Val Gly Glu Phe Thr Val 
100 105 110 



Lys Arg Met Lys Ala Leu Gin Pro Arg He Gin Gin He Val Asp Glu 
115 120 125 



His He Asp Ala Leu Leu Ala Gly Pro Lys Pro Thr Asp Leu Val Gin 
130 135 140 



Ala Leu Ser Leu Pro Val Pro Ser Leu Val He Cys Glu Leu Leu Gly 
145 150 155 160 



Val Pro Tyr Ser Asp His Glu Phe Phe Gin Ser Cys Ser Ser Arg Ser 
165 170 175 



Leu Ser Arg Glu Val Thr Ala Glu Glu Arg Met Thr Ala Phe Glu Ser 
180 185 190 



Leu Glu Asn Tyr Leu Asp Glu Leu Val Thr Lys Lys Glu Ala Asn Ala 
195 200 205 



Thr Glu Asp Asp Leu Leu Gly Arg Gin lie Leu Lys Gin Arg Glu Ser 
210 215 220 



Gly Glu Ala Asp His Gly Glu Leu Val Gly Leu Ala Ala Leu Leu Leu 
225 230 235 240 



He Ala Gly His Glu Thr Thr Ala Asn Met He Ser Leu Gly Thr Val 
245 250 255 



Thr Leu Leu Glu Asn Pro Asp Gin Leu Ala Lys He Lys Ala Asp Pro 
260 265 270 



Gly Lys Thr Leu Ala Ala He Glu Glu Leu Leu Arg He Phe Thr He 
275 280 285 
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Ala Glu Thr Ala Thr Ser Arg Phe Ala Thr Ala Asp Val Glu He Gly 
290 295 300 



Gly Thr Leu He Arg Ala Gly- Glu Gly Val Val Gly Leu Ser Asn Ala 
305 310 315 320 



Gly Asn His Asp Pro Asp Gly Phe Glu Asn Pro Asp Thr Phe Asp He 
325 330 335 



Glu Arg Gly Ala Arg His His Val Ala Phe Gly Phe Gly Val His Gin 
340 345 350 



Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Gin He Val Phe Asp 
355 360 365 



Thr Leu Phe Arg Arg Val Pro Gly He Arg He Ala Val Pro Val Asp 
370 375 380 



Glu Leu Pro Phe Lys His Asp Ser Thr He Tyr Gly Leu His Ala Leu 
385 390 395 400 



Pro Val Thr Trp 



<210> 76 

<211> 404 

<212> PRT 

<213> Saccharopolyspora erythaea 

<400> 76 



Met Thr Thr Val Pro Asp Leu Glu Ser Asp Ser Phe His Val Asp Trp 
15 10 15 



Tyr Arg Thr Tyr Ala Glu Leu Arg Glu Thr Ala Pro Val Thr Pro Val 
20 25 30 



Arg Phe Leu Gly Gin Asp Ala Trp Leu Val Thr Gly Tyr Asp Glu Ala 
35 40 45 



Lys Ala Ala Leu Ser Asp Leu Arg Leu Ser Ser Asp Pro Lys Lys Lys 
50 55 60 



Tyr Pro Gly Val Glu Val Glu Phe Pro Ala Tyr Leu Gly Phe Pro Glu 
65 70 75 80 
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Asp Val Arg Asn Tyr Phe Ala Thr Asn Met Gly Thr Ser Asp Pro Pro 
85 90 95 



Thr His Thr Arg Leu Arg Lys Leu Val Ser Gin Glu Phe Thr Val Arg 
100 105 110 



Arg Val Glu Ala Met Arg Pro Arg Val Glu Gin He Thr Ala Glu Leu 
115 120 125 



Leu Asp Glu Val Gly Asp Ser Gly Val Val Asp He Val Asp Arg Phe 
130 135 140 



Ala His Pro Leu Pro He Lys Val He Cys Glu Leu Leu Gly Val Asp 
145 150 155 160 



Glu Lys Tyr Arg Gly Glu Phe Gly Arg Trp Ser Ser Glu He Leu Val 
165 170 175 



Met Asp Pro Glu Arg Ala Glu Gin Arg Gly Gin Ala Ala Arg Glu Val 
180 185 190 



Val Asn Phe He Leu Asp Leu Val Glu Arg Arg Arg Thr Glu Pro Gly 
195 200 205 



Asp Asp Leu Leu Ser Ala Leu He Arg Val Gin Asp Asp Asp Asp Gly 
210 215 220 



Arg Leu Ser Ala Asp Glu Leu Thr Ser He Ala Leu Val Leu Leu Leu 
225 230 235 240 



Ala Gly Phe Glu Ala Ser Val Ser Leu He Gly He Gly Thr Tyr Leu 
245 250 255 



Leu Leu Thr His Pro Asp Gin Leu Ala Leu Val Arg Arg Asp Pro Ser 
260 265 270 



Ala Leu Pro Asn Ala Val Glu Glu He Leu Arg Tyr lie Ala Pro Pro 
275 280 285 



Glu Thr Thr Thr Arg Phe Ala Ala Glu Glu Val Glu He Gly Gly Val 
290 295 300 

\ 
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Ala lie Pro Gin Tyr Ser Thr Val Leu Val Ala Asn Gly Ala Ala Asn 
305 310 315 320 



Arg Asp Pro Lys Gin Phe Pro Asp Pro His Arg Phe Asp Val Thr Arg 
325 330 335 



Asp Thr Arg Gly His Leu Ser Phe Gly Gin Gly lie His Phe Cys Met 
340 345 350 



Gly Arg Pro Leu Ala Lys Leu Glu Gly Glu Val Ala Leu Arg Ala Leu" 
355 360 365 



Phe Gly Arg Phe Pro Ala Leu Ser Leu Gly lie Asp Ala Asp Asp Val 
370 375 380 



Val Trp Arg Arg Ser Leu Leu Leu Arg Gly lie Asp His Leu Pro Val 
385 390 395 400 



Arg Leu Asp Gly 



j 
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